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BRIEF DESCRIPTION OF SEQUENCE LISTING 



SEQ ID NO: 


Description 


1 


Draft genome assembly from The Genome Science Center in British Colombia, 
Canada of sequence from TOR2 isolate. T0R2jiraft j>enomejjssembly 120403 Release 1 


2 


CDC SARS-CoV strain sequence. Entire nucleotide sequence (Urbani strain) 


3-20 


Group-specific coronavirus gene products 

> Feline infectious peritonitis virus (FBPV) 

3/4 = ORF 3b; 5/6 = ORF 3X; 7/8 = ORF 3A 

> Canine coronavirus 

9/10 = ORF 7b; 1 1/12 = ORF 7a 

> Avian infectious bronchitis virus 

13/14 = ORF 5b; 15/16 = ORF 5a; 17/18 = ORF 3a; 19/20 = ORF 3b 


21-520 


500 primers for left part 


521-1020 


500 primers for right part 


1021-3520 


Forward primers from Table 4 


3521-6020 


Reverse primers from Table 4 


6021-6026 


Figure 9 primers 


6027-6033 


Figure 11 primers 


6034-6038 


Five primers from http://coitteiit.nejm.org/cgUreprint/NEJMoa030781v2.pdf 


6039-6051 


PEP1 to PEP13 


6052 


Extended PEP13 


6053-6056 


229E human coronavirus sequences 


6057-6060 


TGV sequences 


6061-6064 


PEDV sequences 


6065-6068 


Bovine coronavirus sequences 


6069-6071 


Murine hepatitis virus sequences 


6072-6075 


AJDBV sequences 


6076-6170 


Primer sequences (forward) 


6171-6265 


Primer sequences (reverse) 


6266-6304 


Primer sequences (forward) 


6305-6343 


Primer sequences (reverse) 


6344-6366 


Primer sequences (forward) 


6367-6392 


Primer sequences (reverse) 


6393-6440 


Primer sequences (forward) F1-F48 


6441-6487 


Primer sequences (reverse) R1-R47 


6488-6559 


Primer sequences 


6560-6568 


Primer sequences 


6569 


The nsp2 proteinase (3CL-PRO) sequence in SARS coronavirus 


6570-72 


The nsp2 proteinases (3CLp) of avian IB V, MHV, and BCoV 


6573 


Consensus nsp2 proteinases sequence 


6574-6577 


IG sequences from Figure 18 


6578 


Expression construct of nSh in pCMVIII 


6579 


Expression construct of nS in pCMVIII 


6580 


Expression construct of nSh ATC in pCMVIII 


6581 


Expression construct of nS ATC in pCMVIII 


6582 


Expression construct of nSlh in pCMVIII 


6583 


Expression construct of nSl in pCMVIII 


6584-6585 


Primers for cDNA amplification 


6585-6587 


Primers for RT-PCR 


6588-6809 


Component sequences of Figure 23 (>4 amino acids) 


fZQ in *71 ^70 

oolU-/l/y 


Component sequences of Figure 24 (>4 amino acids) 


7180-7187 


N-glycosylation sites within SEQ ID NO: 6039 


7188-7189 


Component sequences of Figure 25 


7190 


Fragment of SEQ ID NO: 7188 


7191 


Polynucleotide encoding SEQ ID NO: 7190 


7192 


Amino acids 879-1005 of SEQ ID NO: 6042 


7193 


Amino acids 879-980 of SEQ ID NO: 6042 
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7194 


Amino acids 901-1 005 of SFO TO NO- 604.9 


7195 


Amino acids 1 144-1901 nf SFO TD NO* 604.9 


7196 


Amino acids 1144-1 106 nf SFO TD NO- 604.9 


7197-7199 


IVIembrane fusion npntirlp rpaions 


7200-7206 


INadA-based nolvnentiripQ 


7207-7223 


N-glvcosvlation sites within SFO TT) NO- 6049 


7224-7231 


Slippage region 


7232 


Orflab oolvnrotein 


7233-7244 


Orflab Dolvnrofpins 


1 7245-7247 


X? seauences for SFO TD NOS 79 ^ "3-79 4.4. 


7248-7253 


Orflab oolvnroteins 


7254 


Zinc binding region 9 ^itp 


7255-7271 


N-slvcosvlation <iitp<2 in SFO TP) NO<5* 604.0 A1 &c\An /^o/ic /t^ rncn n 

_ fexj w\jojfxatx\^XA MICj 111 OJCrV^ 1LJ IN WO. OU^U-^x,0U^J,QU4O-40,OUjU-,5 1 


7272-7291 


PolvoeD tides and nolvnnplpnti Hpq 


7292-7293 


Inter&enic ^ennpnppQ 


7294-7301 


NucleOtidp*? from *V fnH r\f Q APQV (YAnnmo -f>\11/-iiT» a ^l u,. .: * 

x^u^icuiiucb iium j ena 01 oako v genome roilowea by mtergemc sequence 


7302-7306 


NadA constructs 


7307-7308 


Fragments of SFO TD NO- 604.9 


7309 


NadA sea uen pp 


7310-7311 


NadA leader spfinenpf^ 


7312-7315 


Amino acid seaiipnpp<3<5 from NnrIA 

* **"*"v uvxu ljuu uvUvC/jo XX will liaUxV 


7316-7324 


PCR primers 


7325-7330 


Primers 


7331 


CCACC seauence 


7332-7336 


3' UTR forward nrimprs 


7337-7341 


3 UTR reverse nrimprs 


7342-7352 


3 f UTR probes " " "~ " — : 


7353-7362 


5' UTR forward nrimers 


7363-7373 


5 UTR reverse primers 


7374-7385 


5' UTR probes — — 


7386 


Conserved octannplpntiHp 

> — viwvi y wu vywi>cixxu.^xt/VJLXVX& 


7387 


Reverse comnlempnt nf WO TT» NTO* 79Q'3 

J.W v v^i l)v C>*JXXX|w»At^XXXC?XlL \JL \J±Jj\J I U J>1_7, / / .J 


7388 


Intersenic seonenpp 

aa.i.i.wi gviuv OvWJ UwUvv 


7389 


Poly T — _ _ _ 


7390 


Stem-loon sennenpp 


7391-7392 


Polv-fflvcine linkers 

*• wxjr J wj. JLIV^ XXXXXVwl O 


7393 


Polv-histidine tap - 


7394 


Nucleocansid enitoTip Qitp 


7395 


Anti sense primer 


7396-7397 


Probes 


7398-7399 


Antigenic fraffments of SFO TD NO- 604.9 

4 uiug^iuv/ xx txgxxxv^xiLo Ul vjIjV^ Xx-/ Dv/^r^ 


7400-7639 


T-eoitone analvsis of SFO in NO* 60^0 


7640-7800 


T-eDxtone analvsis of wSFO TD NO* 604.0 


7801-8040 


T-enitooe analvsis of vSFO TD NO- 604.1 


8041-8280 


T-eoitone analvsis of SFO ID NO* 604.9 


8281-8486 


T-enitone analvsis of vSFO I n i\ro» 604.^ 


8487-8665 


T-eoitone analvsis of SFO in NO* 604.4 


8666-8820 


T-enitODe analvsis of 5?FO in NO* 604.*^ 


8821-9018 


T-enitODe analvsis of SFO I n NO* 604.6 


9019-9131 j 


T-eoitooe analvsis of SFO m NO* 604.7 


9132-9308 


T-eDitooe analvsis of SFO in NO* 604.& 


9309-9437 


T-enitODe analvsis of SFO m NO* 604.0 


9438-9538 


T-eoitooe analvsis of SFO m NO* 60*^n 


9539-9752 


T-epitope analysis of SEQ ID NO: 6052 


9753-9763 


Primers for spike protein amplification, particularly fragments of spike 


9764-9765 


N-glycosylation sites within SEQ ID NO: 6039 


9766-9779 


Cleavage products for ORFlab (Table 10) 
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9780-9782 


Forward primer, reverse primer, probe 


9783-9784 


Lysine-rich region 


9785-9798 


Oligonucleotides used for S.cerevisiae expression 


9799-9802 


Sequences from Figures 65 & 66 


9803-9882 


Primers for E.coli cloning 


9883-9885 


BCV nucleotide sequences for Figures 3A, 3B, 3C 


9886-9891 


BCV amino acid sequences for Figures 4A, 4B, 4C, 4D, 4E, 4F 


9892 


BCV 5* UTR 


9893 


BCV 3' UTR 


9894-9896 


MHV nucleotide sequences for Figures 3A, 3B, 3C 


9897-9902 


MHV amino acid sequences for Figures 4 A, 4B, 4C, 4D, 4E, 4F 


9903-9904 


AIB V nucleotide sequences for Figures 3A, 3B 


9905-9909 


AIBV amino acid sequences for Figures 4A, 4B, 4D, 4E, 4F 


9910 


AIB V 5' UTR 


9911 


AIBV 3' UTR 


9912-9913 


HOBMPRO, HOBHEGA nucleotide sequences for Figures 3B 3C 


9914-9918 


Human CoV amino acid sequences for Figures 4 A, 4B, 4C 4E 4F 


9919 


HCoV-OC43 5' UTR 


9920 


HCoV-OC43 3' UTR 


9921-9923 


pCMVKm2 vectors 


9924-9926 


Codon-optimised N, M and E sequences 


9927 


BNI-1 


9928-9959 


Constituent amino acid sequences >4aa inferred from SEQ ID NO* 9927 


9960 


ORFlab variant 


9961 


ORFla variant 


9962 


Spike variant 


9963 


Membrane variant 


9964 


Nucleocapsid variant 


9965-9966 


Short ORFs 


9967 


J^RA complete genome 
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CLAIMS 

1 . An isolated polypeptide of the S ARS virus. 

2. The polypeptide of claim 1, wherein the polypeptide is a Spike (S) polypeptide, an Env 
(E) polypeptide, a Membrane (M) polypeptide, a hemagglutinin-esterase polypeptide (HE), a 

5 nucleocapsid (N) polypeptide, a ORFla polypeptide, a ORFlab polypeptide, a proteolytic 
fragment of a ORFla polypeptide, or a proteolytic fragment of a ORFlab polypeptide. 

3. The polypeptide of claim 1, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO s : 6039, 7232, 9766, 9767, 9768, 9769, 9770, 
9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 6042, 6043, 6044, 6045, 6046, 6047, 

10 6048, 6049, 6050 or 6052. 

4. The polypeptide of claim 1, wherein the polypeptide comprises an amino acid sequence 
having >75% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO s : 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 

9769, 9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 
15 10338, 10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 

10577, 10578, 10579, 11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 

5. The polypeptide of claim 1, wherein the polypeptide comprises a fragment of at least 10 
consecutive amino acids of an amino acid sequence selected from the group consisting of SEQ 
ID NO s : 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 

20 9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338, 
10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 
10578, 10579, 11552, 11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 

6. A polypeptide comprising an amino acid sequence having >80% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO s : 6042, 6043, 6044, 

25 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 9770, 9771, 9772, 9773, 
9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338, 10339, 10340, 10341, 
10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 10578, 10579, 11552, 
11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 

7. A polypeptide comprising an amino acid sequence that comprises a fragment of at least 
30 10 consecutive amino acids of an amino acid sequence selected from the group consisting SEQ 

ID NO s : 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 

9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338, 
10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 
10578, 10579, 11552, 11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 
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8. A polypeptide comprising an amino acid sequence having >80% sequence identity to 
SEQ ID NO: 6042, and/or comprising an amino acid sequence that comprises a fragment of at 
least 10 consecutive amino acids of SEQ ID NO: 6042, wherein the polypeptide is in the form of 
a trimer. 

5 9. Nucleic acid encoding the polypeptide of any one of claims 1 to 8. 

10. Nucleic acid according to claim 9, comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NO s : 7191, 7273, 7275, 7277, 7279, 7281, 7283, 7285, 7287, 7289, 
7291, 7292, 7293, 9968, 10066, 10084, 10299, 10505, 11323, 11563, 11639 & 11640. 

11. A polynucleotide comprising a nucleotide sequence having >80% sequence identity to 
10 the nucleic acid of claim 9 or claim 10. 

12. A polynucleotide comprising a fragment of at least 10 consecutive nucleotides of the 
nucleic acid of claim 9 or claim 10. 

13. Antibody that recognizes the polypeptide of any one of claim 1 to 8. 

14. The antibody of claim 13, wherein said antibody recognizes the polypeptide comprising 
15 the amino acid sequence of SEQ ID NO: 6042 or a fragment thereof. 

15 . The antibody of claim 14, wherein said antibody recognizes the polypeptide comprising 
the amino acid sequence of SEQ ID NO: 6042 or a fragment thereof in trimeric form. 

16. The antibody of claim 13, wherein the antibody is a monoclonal antibody, 

17. The antibody of claim 13, wherein the antibody is a human antibody, 

20 18. An immunoassay for detecting a SARS virus antigen in a sample, comprising the step of 
contacting the sample with the antibody of any one of claims 13 to 17. 

19. An immunoassay for detecting an antibody against a SARS virus antigen in a sample, 
comprising the step of contacting the sample with the polypeptide of any one of claims 1 to 8. 

20. A method of detecting an antibody against a SARS virus antigen in a sample comprising 
25 contacting said sample with the polypeptide of any one of claims 1 to 8, under conditions 

suitable for binding said polypeptide to said antibody, if present, and detecting the binding of 
said polypeptide to said antibody. 

21. A method for detecting a SARS virus antigen in a sample comprising contacting said 
sample with the antibody of any one of claims 13 to 17, under conditions suitable for binding 

30 said antibody to said antigen, if present, and detecting the binding of said antibody to said 
antigen. 
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22. A vaccine for the treatment or prevention of severe acute respiratory syndrome (SARS), 
comprising an inactivated SARS virus, a killed SARS virus, an attenuated SARS virus, a split 
SARS virus preparation, or at least one purified SARS virus antigens. 

23. The vaccine of claim 22, comprising a purified polypeptide according to any one of 
5 claims 1 to 8. 

24. The vaccine of claim 22 or claim 23, wherein the antigen is a purified SARS virus 
antigen in the form of a VLP. 

25. The vaccine of any one of claims 22 to 24, further comprising an adjuvant. 

26. The vaccine of claim 25, wherein the adjuvant is an aluminium salt or is MF59. 

10 27. The vaccine of any one of claims 22 to 26, comprising more than one SARS virus 
antigen. 

28. The vaccine of claim 27, wherein the antigens are selected from S, E, N and M. 

29. The vaccine of claim 22, comprising an inactivated SARS virus. 

30. The vaccine of claim 29, wherein said virus is inactivated by chemical or physical means. 
15 31. The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 

an effective amount of one or more of the following agents selected from the group consisting of 
detergents, formaldehyde, formalin, P-propriolactone, and UV light. 

32. The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 
an effective amount of one or more of the following agents selected from the group consisting of 

20 methylene blue, psoralen and carboxyfullerene (C60). 

33. The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 
an effective amount of one or more of the following agents selected from the group consisting of 
binary ethylamine, acetyl ethyleneimine and gamma irradiation. 

34. The vaccine of claim 31, wherein said inactivation comprises treatment with P- 
15 propriolactone. 

35. The vaccine of claim 34, wherein said P-propriolactone is used at a concentration of 0.01 
to 0.5%. 

36. The vaccine of claim 34, wherein said p-propriolactone is used at a concentration of 0.5 
to 0.2%. 

50 37. The vaccine of claim 34, wherein said P-propriolactone is used at a concentration of 
0.025 to 0.1%. 



-637- 



WO 2004/092360 



PCT/US2004/011710 



38. A method of inactivating SARS virus comprising exposing the virus to an inactivation 
agent for 12 to 24 hours at refrigeration temperatures followed hydrolysis of any residual 
inactivating agent by elevating the temperature for three hours. 

39. The method of claim 38, wherein the inactivation agent is P-propriolactone. 

40. The method of claim 38, wherein the refrigeration temperature is between 0°C and 8°C. 

41. The method of claim 38, wherein the elevated temperature is between 33°C and 41°C. 

42. A method for making an inactivated SARS vaccine comprising: 

a. innoculating a mammalian cell culture with SARS virus; 

b. cultivating the infected cells; 

c. harvesting SARS virus containing supernatant; 

d. inactivating the SARS virus; and 

e. purifying the inactivated SARS virus. 

43. The method of claim 42, wherein said mammalian cell culture is derived from one or 
more of the cell types selected from the group consisting of fibroblast cells, endothelial cells, 
hepatocytes, keratinocytes, immune cells, mammary cells, smooth muscle cells, melanocyte 
cells, neural cells, prostate cells, renal cells, skeletal cells, liver cells, retinoblast cells and 
stromal cells. 

44. The method of claim 42, wherein said mammalian cell culture is derived from a cell 
culture selected from the group consisting of human cells, non-human primate cells, HeLa cells, 
human diploid cells, fetal rhesus lung cells, human embryonic kidney cells, VERO cells, horse 
cells, cow cells, sheep cells, dog cells, cat cells or rodent cells. 

45. The method of claim 42, wherein said mammalian cell culture is derived from VERO 
cells or fetal rhesus kidney cells. 

46. The method of claim 42, wherein said mammalian cells are cultured in serum free media. 

47. The method of claim 42, wherein said mammalian cells are cultured in protein free 
media. 

48. The method of claim 42, wherein said inoculating step comprising absorbing the SARS 
virus onto the cell culture for 60 to 300 minutes. 

49. The method of claim 42, wherein said inoculating step is conducted at 25°C to 40°C. 

50. The method of claim 42, wherein said purification step comprises one or more of the 
treatments selected from the group consisting of gradient centrifugation, ultracentrifugation, 
continuous-flow ultracentrifugation, chromatography, polyethylene glycol precipitation, and 
ammonium sulfate precipitation. 
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51. The method of claim 42, wherein said purification step comprises one or more of the 
treatments selected from the group consisting of ultrafiltration and dialfiltration. 

52. The method of claim 50, wherein said chromatography treatment includes one or more of 
the chromatography treatments selected from the group consisting of ion exchange 

5 chromatography, size exclusion chromatography, and liquid affinity chromatography. 

53. The method of claim 52, wherein said chromatography treatment includes use of one 
more chromatographic resins selected from the group consisting of an an anionic resin and a 
cationic resin. 

54. The method of claim 52, wherein the ion exchange chromatography treatment includes a 
10 first step using a strong anion exchange resin and a second step using a strong cation exchange 

resin. 

55. The method of claim 50, wherein said gradient centrifugation purification step comprises 
density gradient centrifugation. 

56. The method of claim 42, wherein said purification step comprises a first step of 
15 chromatography purification and a second step of gradient centrifugation. 

57. The method of claim 56, wherein said first chromatography purification step comprises 
liquid affinity chromatography. 

58. The method of claim 56, wherein said second gradient centrifugation step comprises 
density gradient centrifugation. 

20 59. A single-stranded oligonucleotide comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOS: 21-6020, 6076-6568, 6586-6587, 7292-7301, 7325-7328, 
7332-7352, 7353-7385, 10235-10298, 10352-10504, 10580-11322 and 11325-11551. 

60. A single-stranded oligonucleotide comprising the complement of the oligonucleotide of 
claim 59. 

25 61. The oligonucleotide of claim 59 or claim 60, comprising 10-30 nucleotides. 

62. The oligonucleotide of claim 61, comprising the nucleotide sequence of SEQ ID NO: 
7292, SEQ ID NO: 7293, the complement of SEQ ID NO: 7292 or the complement of SEQ ID 
NO: 7293. 

63. A kit comprising primers for amplifying a template sequence contained within a SARS 
30 virus nucleic acid target, the kit comprising a first primer and a second primer, wherein the first 

primer comprises a sequence substantially complementary to a portion of said template sequence 
and the second primer comprises a sequence substantially complementary to a portion of the 
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complement of said template sequence, wherein the sequences within said primers which have 
substantial complementarity define the termini of the template sequence to be amplified. 

64. The kit of claim 63, wherein the template sequence is contained within SEQ ID NO: 1 
and/or SEQ ID NO: 2. 

5 65. The kit of claim 63 or claim 64, wherein the first primer comprises a fragment of 8 or 
more nucleotides of SEQ ID NO: 1, and the second primer comprises a fragment of 8 or more 
nucleotides of the complement of SEQ ID NO: 1. 

66. The kit of claim 63 or claim 64, wherein the first primer comprises a fragment of 8 or 
more nucleotides of SEQ ID NO: 2, and the second primer comprises a fragment of 8 or more 

10 nucleotides of the complement of SEQ ID NO: 2. 

67. The kit of claim 63, wherein the first primer is an oligonucleotide according to any one of 
claims 59 to 62 and the second primer is an oligonucleotide according to any of claims 59 to 62. 

68. The kit of any one of claims 63 to 67, further comprising a labeled probe that comprises 
either a fragment of 8 or more nucleotides of SEQ ID NO: 1 and/or SEQ ID NO: 2, or the 

15 complement of said fragment, which fragment is located within the template sequence. 

69. , The kit of any one of claims 63 to 68, wherein the first primer and/or the second primer 
comprises a nucleotide sequence selected from the group consisting of SEQ ID NOS: 21-6020, 
6076-6568, 6586-6587, 7292-7301, 7325-7328, 7332-7352, 7353-7385, 10235-10298, 10352- 
10504, 10580-11322 and 11325-11551. 

10 70. The kit of any one of claims 63 to 68, wherein the first primer and/or the second primer 
comprises the complement of a nucleotide sequence selected from the group consisting of SEQ 
ID NOS: 21-6020, 6076-6568, 6586-6587, 7292-7301, 7325-7328, 7332-7352, 7353-7385, 
10235-10298, 10352-10504, 10580-11322 and 11325-11551. 

71. A method of detecting the presence of SARS virus in a sample comprising providing a 

5 sample suspected of containing a SARS virus nucleic acid target, amplifying a template sequence 
contained within said SARS virus nucleic acid target with the kit of any one of claims 63 to 70, 
and detecting the amplified template sequence, wherein the presence of the amplified template 
sequence indicates the presence of SARS virus in said sample. 

72. The method of claim 7 1 , wherein said amplifying is accomplished using polymerase 
0 chain reaction, transcription mediated amplification, reverse transcription PCR, ligase chain 

reaction, strand displacement amplification or nucleic acid sequence-based amplification. 

73. A double-stranded RNA molecule with a length from about 10 to about 30 nucleotides 
which is able to inactivate the SARS coronavirus in a mammalian cell. 
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74. The double-stranded RNA of claim 73, wherein the sequence of one of the strands is at 
least 90% identical to a target sequence, wherein the target sequence is a fragment of SEQ ID 
NO: 1 and/or SEQ ID NO: 2. 

75. The double-stranded RNA of claim 73 or claim 74, wherein the target sequence 

5 comprising a nucleotide sequence selected from the group consisting of SEQ ID NOS: 7292, 
7293, 7294, 7295, 7296, 7297, 7298, 7299, 7300 and 7301. 

76. The double-stranded RNA of any one of claims 73 to 75, comprising at least one 
modified nucleotide. 

77. A method for treating a patient suffering from SARS, comprising: administering to the 
10 patient a therapeutically effective dose of a molecule of less than 1000 g/mol. 

78. The method of claim 77, wherein the molecule has an aromatic region and greater than 
one heteroatom selected from O, S, or N. 

79. A method for treating a patient suffering from SARS, comprising: administering to the 
patient a therapeutically effective dose of a compound selected from: a nucleoside analog, a 

15 peptoid, an oligopeptide, a polypeptide a protease inhibitor, a 3C-like protease inhibitor , a 
papain-like prqtease inhibitor, or an inhibitor of an RNA dependent RNA polymerase. 

80. A method for treating a patient suffering from SARS, comprising: administering to the 
patient a steroidal anti-inflammatory drug in combination with at least one antiviral compound. 

81. A method for treating a patient suffering from SARS, comprising: administering to the 
20 patient a therapeutically effective dose of a compound selected from: acyclovir, gancyclovir, 

vidarabidine, foscamet, cidofovir, amantidine, ribavirin, trifluorothymidine, zidovudine, 
didanosine, zalcitabine, an antiviral compound listed in Table 1; an antiviral compound listed in 
Table 2; or an interferon. 

82. The method of claim 81, wherein the interferon is an interferon-ct or an interferon-p. 

25 83. The method of any one of claims 77 to 82, wherein the molecule or compound is 
delivered by inhalation. 

84. A method of identifying a therapeutically active agent comprising the steps of: (a) 
contacting a therapeutically active agent with a cell infected with the SARS virus; (b) measuring 
attenuation of a SARS related enzyme. 

30 85. A viral vector or particle for in vivo delivery of a nucleic acid of claim 9 or claim 10. 

86. The viral vector of claim 85, wherein the vector is an adenovirus vector, a poxvirus 
vector or an alphavirus vector. 

87. An alphavirus replicon particle comprising one or more SARS viral antigens. 
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88. The replicon particle of claim 87, wherein said SARS viral antigen is a spike protein. 

89. The replicon particle of claim 87, wherein said particle comprises a replicon derived from 
Venezuelan Equine Encephalitis (VEE) and further comprises an envelope derived from Sindbus 
virus (SIN) or Semliki Forest Virus (SFV). 

5 90. A vaccine comprising one or more SARS virus antigens and one or more respiratory 
virus antigens. 

91. The vaccine of claim 90, wherein said respiratory virus antigens are selected from the 
group consisting of influenza virus, human rhinovirus (HRV), parainfluenza virus (PIV), 
respiratory syncytial virus (RSV), adenovirus, metapneumo virus, and rhinovirus. 

10 92. The vaccine of claim 91, wherein said respiratory virus antigen is from influenza virus. 

93. The vaccine of claim 90, wherein said respiratory virus antigen is from a coronavirus 
other than the SARS virus. 

94. A polypeptide comprising an immunogenic, surface exposed fragment of the amino acid 
sequence SEQ ID NO: 6042. 

15 95. The polypeptide of claim 94, wherein said fragment does not include the last 50 amino 
acids of the C-terminus of SEQ ID NO: 6042. 

96. The polypeptide of claim 94, wherein said fragment does not include a transdomain 
region of SEQ ID NO: 6042. 

97. The polypeptide of claim 94, wherein said fragment does not include a C-terminus 
20 cytoplasmic domain of SEQ ID NO: 6042. 

98. The polypeptide of claim 94, wherein said fragment does not include a N-terminus signal 
sequence. 

99. An isolated polynucleotide comprising a nucleic acid sequence selected from the group 
consisting of SEQ ID NOS: 9968 and 10066. 

25 100. The polynucleotide of claim 99, wherein the polynucleotide comprising a nucleic acid 
sequence having > 80% sequence identity to a polynucleotide sequence selected from the group 
consisting of SEQ ID NOS: 9968 and 10066. 

101. An isolated polynucleotide comprising a fragment of at least 15 consecutive nucleic acids 
of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 9968 and 10066 

30 and wherein said fragment does not consist entirely of SEQ ID NO: 10033. 

102. An isolated polypeptide comprising an amino acid sequence encoded by any one of 
claims 99-101. 
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103. The polypeptide of claim 102, comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 9969 - 10032, 10067, and 10015. 

104. The polypeptide of claim 103, wherein the amino acid sequence is selected from the 
group consisting of SEQ ID NOS: 9997, 9998 and 10015. 

5 105. An expression construct for recombinant expression of a SARS virus spike protein 

wherein said construct comprises a nucleic acid sequence selected from the group consisting of 
SEQ ID NOS: 6578 - 6583. 

106. A mammalian cell line stably expressing a SARS viral antigen. 

107. The cell line of claim 106, wherein said cell line is a Chinese Hamster Ovary (CHO) cell. 

10 108. The cell line of claim 106, wherein the SARS viral antigen is a spike protein or fragment 
thereof. 

109. The cell line of claim 106, wherein the spike protein is truncated to remove the 
transmembrane sequence. 

1 10. A method of identifying a therapeutically active agent comprising the steps of: (a) 
15 contacting a therapeutically active agent with a buffer comprising SARS enzyme; and (b) 

measuring attenuation of the SARS enzyme. 

111. The method of claim 110 wherein the SARS enzyme is a SARS protease. 

1 12. The method of claim 111 wherein the buffer further comprises a peptide with a SARS 
protease cleave site. 

20 113. The method of claim 110 wherein the measurement is made by the measurement of 
fluorescence. 

1 14. A vaccine of one of claims 22 to 37, and 90 to 93 further comprising an adjuvant. 

115. The vaccine of claim 1 14 wherein the adjuvant is a SMTP. 

116. The vaccine of claim 115 wherein the SMTP compound is selected from the group 
25 consisting of an acylpiperazine, a tryptanthrin, an indoledione, a tetrahydroisoquinoline, a 

benzocyclodione, an amino azavinyl compound, a thiosemicarbazone, a lactam, an 
aminobenzimidazole quinolinone, a hydropthalamide, a benzophenone, an isoxazole, a sterol, a 
quinazolinone, a pyrole, an anthraquinone, a quinoxaline, a triazine, an benzazole, and a 
pyrazolopyrimidine, or a pharmaceutically acceptable salt, ester, or prodrug thereof. 

30 1 17. A method of vaccinating a subject comprising administering a vaccine of one of claims 
22 to 37, and 90 to 93. 

118. The method of claim 1 17 further comprising administering a SMIP. 
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119. A method for treating a patient of one of claims 77 to 82 further comprising 
administering at least one SMIP compound. 

120. A method for treating a patient of one of claims 77 to 82 further comprising 
administering at least one SMIS compound. 
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; Section 18 
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BCV N (906) GT TTTTCTTT 30P TC AAGA'r l~AGAG1 TGG 5c AA S GTGCAGAAT 
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-Section 26 
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Avian infecUous bronchitis N (986) CACGTCGAA^ 
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-Section 32 
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FIGURE 3B (contd.) 



— Section 7 
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BCV M (212) T T.TT CAATTGCGTG 1' A TGCGTT G A 7v TAATGT GTAT C^i^fec 
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Consensus (247) TTTTTAATTGCOTAT^ 

— — — Section 8 
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BCV M (253) ^TTCTATAGTTTTCAC.TAT AGl'GGCCATT a I CATGTGGAT 

MHV M (256) ttttctat;a^tgtti7\ctai'agtgtccattataatgtggat 
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Consensus (288) ttttctataqtttttactataot atgtggat 
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Avian infectious brochitis virus M (424) cgT^ci- - --- C C SAT AA AG A A T isid AGT 3«C $ T^'^S'T'G'i'' 
Consensus (493) AT ctttacatgcaaggtataaagctaggtactggctattct 



-Section 14 

(534) 534 ,540 550 560 574 
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FIGURE 3C 
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(1) 1 _ 4 10 _ 20 ,30 , ,40 . 63 
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Consensus (1) ctaaactcagtgaaaatgtttttgcttc tagattt ttctagttagctgcat 

__________ Section 2 
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HOBHEGA (54) AT^G G *i? A Gj C*TpF A GGT T T T,T ^CAAC 6 C X C C I* A C C A A X G X T G ^ T T C G C AX G T A A 
BGV HE (54) 'SATTGGTAGC CT AG GTT T T.G A C A AT C CTC.C TAG CAATGTTGTTTCGGATTTAA 

MHV HE (1 ) - - - « 

Consensus (54) AATTGGTAGC XAGGXTTT ACAA CCTCCTACGAATGTTGTT TCGCAT TAA 

— - - - ■ — - — ~ — — — Section 3 

(107) 107 ,120 ,130 M9 159 

HOBHEGA (107) ^^^^^^^^^^^^^^^^^^P^P^^^^^^^PI^^P^^ 
BGV HE (107) AT G Q Ap A T,T G G,TX TTTATTTGGTG A C AG T CGTTC AG A XT GT A A X C A TG T TGTT 

MHV HE (1) - ■ « .._»--. 

Consensus (107) atggagattqgtttttatttggtgacagtcgttcagattqtaatcat ttgtt 

- ' — — — — — Section 4 
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HOBHEGA (160) l&3l&^ 

BCVHE (iaO)icTACCAACCCGCGTAATTATTCTTATATGGACCTTAATCCTGCCTTGTGT|G 

MHV HE fl) - - - 

Consensus (160) a ta caaccccc taattattcttatatggaccttaatcctg , tgtgtg 
_ _____ Section 5 

(213) 213 ggQ 232 £40 gm 285 
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Consensus (213) ttctggtaaaatatcatctaaagctggcaactccatxtttaggagttxtcact 

. ______ Section 8 

(288) 266 _ j28Q_ _ _ J90 _ 300 318 

HOBHEGA (286) fx ACCG ATTTTT ATAATTAcScAGGCGAAGGTCAACAAAT r A T tIo T T T A T G A G 
BGV HE (266) TTACCGATTXTT ATAATT ACACAGGC GAAGGTCAACAAATTATTTT CT ATGAG 

MHV HE (1) — ■ - ^ : - 

Consensus (266) TTA C CG ATTTTTATAAT T AC AC AGGCGAAGGT C AACAAATT AT TTT TATGAG 

— Section 7 
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HOBHEGA (319) &GT GT TA AT,T T 'f AC G c'c T T A X C A T G C'C X X T A A A T § C AA&C G TTCT GG T AG 

BGV HE (319) GGTGTTAATTTTACGCCTTATCATGCCTTTAAATGCXCC ACTTCTQGT AGTA A 

MHV HE (1) — — --- 

Consensus (319) GGTGTTAATTTTACGCCXXATCA'TGCCTXXAAATGCA c TTCTGGTAGTAA 
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FIGURE 3C (contd.) 
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Consensus {1210} DQVF A WKCXE C D LD L K L G L D AM FF Y G DVVSHVCK 

— — — — — Section 33 

(1249) 1240 , .1260 ,1270 1287 
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Consensus (1717) l m kwn vv fkg is y f i f kq am n i^c fx ^ c"i mlq a L L 

. . , , Section 46 

(1 756) 1756 _ ,1770 ,1780 _ 1794 

avian infectious bronchitis pol 1ab (1289) JiJi £ A WAKLLG - F«T PFVAWCY : * C*A&*VGDFS • 

bovine coronavirus pol 1ab(1686) JfcWl ^Qf '&^A WLB Fife? G%P[A P F V s^vi ? KG GFKFGDFA 




Consensus (1786) KFK FQWQEAW EFRSGftp" RFVALVLAkGGFRFGDFSD 

. , — — Section 47 

(1795). 1795 J800 J&VL ,1820 1833 

avian Infectious bronchitis pot 1ab(1327) Slim I#A N || A E H F;|| A D YT Kf A F ||K K R V B jj|jfcif C G I K £ — — 

bovine coronavirus pot 1ab(1725) SjR'D f l rv v f s q v Jd i? t g a x od ir e i ••xkvgvkqeqrtgvda 
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(2068) 2068 2080 2090 2106 
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— — ■ — — Section 59 
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— -*— _____ Section 60 
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Section 64 
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bovine coronavirus pol 1ab (2385) ^UHv^t^M^B^^^^W^^^^i'e x i g r-tfi^yfj* 
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Murine hepatitis pol 1ab (2589) Ifl®^^ ^ 
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FIGURE 4F (contd.) 
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FIGURE 4F (contd.) 
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FIGURE 4F (contd.) 
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FIGURE 5 
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FIGURE 6 

FIGURE 6A 
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FIGURE 6B 
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FIGURE 6C 
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FIGURE 7 



FIGURE 7 A 
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YFPTVQPWFNC IRNDSITOLWTLENLKALYWDYATENITWHRQRL1WVVNGYPYS ITVT 120 

YLPSMNSSSWYCGTGIETASGVHGIFLSYIDSGQGFEIGISQEP FDPSGYQLYLH 107 

TDTVDVTNGLGTYYVLDRVYLNTTLLLNG YY 69 

TETVEVSQGLGTYYVLDRVYLNATLLLTG YY 71 

VYYPDEIFRSDTLYLTQDLFLPFYSNVTGFHTINHTFG- NP 79 

_ YVYYYQSAFRPPSGWHLQG 46 



IAGCQTTN GLNTSYSVCNG 31 

TTRNFNSAEGAIICICKGSPPTTTTESSLTCNWGSECRLNHKFPICPSNSEMCGmLYG 180 
KATNGNTNAIARLRICQFPDMTLGPTVNDVTTGRNCLFNKAI PAYMRDGKDIWGITWD 167 
PTSGSTYRimLKGTLLLSTLWFKPPFLSDFTNGIFAKVKNTKVIKDGW 12 9 
PVDGSKFRNLALTGTNSVSLSWFQPPYLSQFNDGIFAKVQNLKTSTPSGATAYFPTIVIG 13 1 
VIPFKDGIYFAATEKSNVTOGWVFGSTMNNKSQSV^ 139 
GAYAVVNISSEFNNAGSSSGCTVGIIHGGRVVNASSIAMTAP — 88 



LQWFADEWAYLHGASYRISFENQWSGTVTFGDMRATTLEVAGTLVDLWWFNPVYDVSYY 240 

NDRVTVFADKIYHFYLKNDWSRVATRCYNRRSCAMQYVYTPTYYMLN 214 

STFVNTSYSVWQPHTTILGNKLQGFLEISVCQYTMCEYPNT 171 

SLFGYTSYTWIEPYN GVIMASVCQYTICLLPYT 165 

VS — KPMGTQTHTMIFDNAFN- - - 158 



SEQ ID NO: 6053 CVGYSEWFAVESGGYIPSDFAFMWFLLTNTSSVVDGVVRSF 74 

RVMTKNGTTWS 300 
VTSAGEDGIYYEPCTANCTGYAAWFATDSNGHIPEGFS 274 
ICNPN-LGNQRVELWHWDTGWSCLYKRNFTYDVNADYLYFHFYQEGGTFYAYFTDTGW 230 
DCKPNTNGNKLIGFWHTDVKPPICVLKRNFTLNWADAFYFHFYQHGGTFYAYYADKPSA 225 
-CTFEYISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDVVRDLPSGFNTLK 217 
SSGMAWSSSQFCTAHCNFSDTTVFVTHCYKHGG- -CPLTGMLQQN 13 1 



QPLLLNCLWSVSGLRFTTGFVYFNGTGRGDCKGFSSDVLSDVIRYNLNFEENLRRG- — T 131 
QPLLWCLWPVPSFEEAASTFCFEGAGFDQCNGAVLNNTVDVIRFNLNFTTNVQSGKGAT 360 
QPLLVNCLLAIPKIYGLGQFFSFNHTMDGVCNGAAVDRAPEALRFNINDTSVILAEG--S 332 

TKFLFNVYLGTVLSHYYVMPLTCN S ALTLE YWVT PLT S KQ YLLAFNQDGVI FNAVD 286 

TTFLFSVYIGDILTQYYVLPFICNPTAGSTFAPRYWVTPLVKRQYLFNFNQKGVITSAVD 285 
PIFKLPLGINITNFRAILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVD 277 



WO 2004/092360 



62/193 



PCT/US2004/011710 



SEQ ID NO: 6072 LIRVSAMKNGQLFYNLTVSVAKYPT^ 191 



SEQ ID NO: 6053 ILFKTSYG VWFYCTNNT-LVSGDAHIPFGTVLGNFYCFVNTTIGNETTSAFVGAL 186 

SEQ ID NO: 6057 VFSLNTTGGVTLEISCYTVSDS-SFFSYGEIPFGVTDGPRYCYVHY NGTALKYLGTL 416 

SEQ ID NO: 6061 IVLHTALG TNLSFVCSNSSDPHLAIFAIPLGATEVPYYCFLKVDTYNSTVYKFLAVL 389 

SEQ ID NO: 6065 CKSDFMSEIKCKTLSIAPSTGVYELNGYTVQPIADVYRRIPNLPDCN-IEAWLNDKSVPS 345 

SEQ ID NO: 6069 CASSYTSEIKCKTQSMLPSTGVYELSGYTVQPVGVVYRRVANLPACN-IEEWLTARSVPS 344 

SEQ ID NO: 6042 CSQNPLAELKCSVKSFEIDKGIYQTSNFRWPSGDVVIl-FPNITNLCPFGEVFNATKFPS 336 

SEQ ID NO: 6072 FKAGGPITYKVMREVKALAYFVNGTAQDVILCDGSPRGLLACQYNTGNFSDGFYPFTNSS 251 



SEQ ID NO: 6053 PKTVREFVISRTGHFYINGYRYFTLGNVEAVNFNVT TAETTDFCTVALASYADVLV 242 

SEQ ID NO: 6057 PPSVKEIAISKWGHFYINGYNFFSTFPIDCISFNLT TGDS DVFWT I AYTS YTEALV 472 

SEQ ID NO: 6061 PPTTOEIVITKYGDVYVNGFGYLHLGLLDAVTINFTGHGTDDDVSGFWTIASTNFVDALI 449 

SEQ ID NO: 6065 PLNWERKTFSNCNFNMSSLMSFIQAYSFTCNNIDAA KIYGMCFSSITIDKFAIPNG 401 

SEQ ID NO: 6069 PLNWERKTFQNCNFNLS SLLRYVQAESLFCNNIDAS KVYGRCFGS ISVDKFAVPRS 400 

SEQ ID NO: 6042 VYAWERKKISNCVADYSVLYNSTFFSTFKCYGVSAT KLNDLCFSNVYADSFWKGD 392 

SEQ ID NO: 6072 LVKQKFIVYRENSVNTTCTLHNFIFHNETGANPNPSG--VQNIQTYQTKTAQSGYYNFNF 309 

• • > : • : 

SEQ ID NO: 6053 NVSQTSIANIIYCNSVINRLRCDQLSFDVPDGFYSTSP--IQSVELPVSIVSLPVYHKHT 300 

SEQ ID NO: 6057 QVENTAITKVTYCNSHVNNIKCSQITANLNNGFYPVSS — SEVGLVNKSWLLPSFYTHT 530 

SEQ ID NO: 6061 EVQGTSIQRILYCDDPVSQLKCSQVAFDLDDGFYPISSRNLLSHEQPISFVTLPSFNDHS 509 

SEQ ID NO: 6065 RKVDLQLGNLGYLQSFNYRIDTTATSCQLYYNLPAANVS — VSRFNPSTWNRRFGFTEQS 459 

SEQ ID NO: 6069 RQVDLQLGNSGFLQTANYKIDTAATSCQLHYTLPKNNVT — INNHNPSSWNRRYGFNDAG 458 

SEQ ID NO: 6042 DVRQIAPGQTGVIADY1TYKLPDDFMGCVLAWNTRNIDAT STGNYNYKYRYLRHG 446 

SEQ ID NO: 6072 SFLS SFVYKESNFMYGS YHPSCKFRLETINNGLWFNSLS VSIAYGPLQGGCKQS 363, 



SEQ ID NO: 6053 FIVLYVDFKPQ-SGGGKCFNCYPAGVNITLANFNET KGPLCVDTSHFTTKYVAVYAN 356 

SEQ ID NO: 6057 IVNITIGLGMKRSGYGQPIASTLS — NITLPMQDHN TDVYCIRSDQFS-VYVHSTCK 584 

SEQ ID NO: 6061 FVNITVS AAFGGLSSANLVASDTTINGFSS FCVDTRQFTITLFYNVTN 557 

SEQ ID NO: 6065 VFKPQPAGVFTDHDWYAQHCFKASTNFCPCKLDGSLCVGNGPGIDAGYKTSGIGTCPAG 519 

SEQ ID NO: 6069 VFGKN- QHDWYAQQCFTVRSSYCPC 483 

SEQ ID NO: 6042 KLRPFER - 453 

SEQ ID NO: 6072 VFKGRAT 370 



SEQ ID NO: 6053 VGRWS ASINTGNCPFSFGKVNNFVKFGSVCFSLKDIPGGCAMPIVA 402 

SEQ ID NO: 6057 SALWDNIFKRNCTDVLDATAVIKTGTCPFSFDKLNNYLTFNKFCLSLSPVGANCKFDVAA 644 

SEQ ID NO: 60 61 SYGYVS KSQDSNCPFTLQSVNDYLSFSKFCVSTSLLAGACTIDLFG 603 

SEQ ID NO: 6065 TNYLTCHNAAQCDCLCTPDPITSKATGPYKCPQTKYLVGIGEHCSGLAIKSDHCG G 575 

SEQ ID NO: 6069 AQPDIVSPCTT QTKPKSAFVNVGDHCEGLGVLEDNCGNADPH 525 

SEQ ID NO: 6042 -- DI SNVPFSPDGKPCTPPALNC YWPLND 480 

SEQ ID NO: 6072 CCYAYSYGGPSLCKGVYSGELDHNFECGL 399 



SEQ ID NO: 6053 NWAYSKY YTIG SLYVSWSDGDGITGVPQPVEGVSSFiyiNVTLDKC 446 

SEQ ID NO: 6057 -RTRTNE QWR SLYVIYEEGDNIVGVPSDNSGVHDLSVLHLDSC 687 

SEQ ID NO: 6061 YPAFGSG VKLT SLYFQFTKGELITGTPKPLEGITDVSFMTLDVC 647 

SEQ ID NO: 6065 NPCTCQP QAFLGWSVDSCLQGDRCNIFANFILHDVNSGTTCSTDLQKSNTDIILGVC 632 

SEQ ID NO: 6069 KGCICAN NSFIGWSHDTCLVNDRCQIFANILLNGINSGTTCSTDLQLPNTEWTGIC 582 

SEQ ID NO: 6042 YGFYTTTGIGY QPYRVWLSFELLNAPATVCGPKLSTDLIKNQC 524 

SEQ ID NO: 6072 LVYVTKS GGSRIQTATEPPVITQNNYNNITLNTC 433 
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SEQ ID NO: 6053 TKYNIYDVSGVGVIRVSNDTFLN GITYTSTSGNLLGFKDVTKGTIYSITPC 497 

SEQ ID NO: 6057 TDYNIYGRTGVGI IRQTNRTLLS GLYYTSLSGDLLGFKNVSDGVIYSVTPC 738 

SEQ ID NO: 6061 TKYTIYGFKGEGIITLTNSSILA GVYYTSDSGQLLAFKNVTSGAVYSVTPC 698 

SEQ ID NO: 6065 VNYDL YG I T GQG I F VE VNAT Y YNS WQNLLYDSNGNLYGFRDYLTNRTFMIRSC 685 

SEQ ID NO: 6069 VKYDLYGITGQGVFKEVKADYYNS WQTLLYDVNGNLNGFRDLTTNKTYTIRSC 635 

SEQ ID NO: 6042 VNFNFNGLTGTGVLTPSSKRFQP - — FQQFGRDVSDFTDSVRDPKTSEILDISPC 576 

SEQ ID NO: 6072 VDYNIYGRTGQGFITNVTDSAVSYNYLADAGLAILDTSGSIDIFWQGEYGLNYYKVNPC 493 

NPPDQLWYQQA — WGAMLSENFTSYGFSNWELPKFFYASNGTYN 542 

DVSAQAAVIDGT- - IVGAITSINSELLGLTHWTTTPNFYYYS IYNYTNDRTRGTAIDSND 796 

SFSEQAAYVNDD- - 1 VGVI S S LSNS — TFNNTRELPGFF YHSNDGSN 741 

YSGRVSAAFHAN — SSEPALLFRNIKCNYVFNNTLSRQLQPINYFDSYLGCVVNADNSTS 743 
YSGRVSAAFHKD — APEPALLYRNINCSYVFSNNISREENPLNYFDSYLGCWNADNRTD 693 
AFGGVSVITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRIYST6NNVFQTQAGCL 636 
EDVNQQFWSGGK-LVGILTSRNETGSQLLENQFYIKITNGTRRFRRSITEN 544 



-CTDAVLTYS SFGVCADGS -— 1 1 AVQPRNVS YDSVS AIVTAN LSI 584 

VDCEPVITYSNIGVCKNGA FVFINVTHSDGD-VQPISTGN VTI 838 

-CTEPVLVYSNIGVCKSGS IGYVPSQYGQVK- IAPTVTGN ISI 7 82 

SWQTCDLTVGSGYCVDYSTKRRSRRSITTGYRFTNFEPFTVNSVNDSLEPVGGLYEIQI 803 

EALPNCDLRMGAGLCVDYSKSRRADRSVSTGYRLTTFEPYTPMLVNDSVQSVDGLYEMQI 753 

SEQ ID NO: 6042 IGAEHVDTSYECDIPIGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSSIAYSNNTIAI 696 

ANC P YVS YGKFC I KPDGS - 1 AT I VPKQLEQFVAPLFNVTEN VLI 588 



PSNWTTSVQVEYLQITSTPIVVDCSTWCNGNVRCVELLKQYTSACKTIEDALRNSARLE 644 
PTNFTISVQVEYIQVYTTPVSIDCSRYVCNGNPRCNKLLTQYVSACQTIEQALAMGARLE 898 
PTNFSMS IRTEYLQLYNTPVSVDCATYVCNGNSRCKQLLTQYTAACKTIESALQLSARLE 842 
PSEFTIGNMEEFIQTSSPKVTIDCSAFVCGDYAACKSQLVEYGSFCDNINAILTEVNELL 863 
PTNFTIGHHEEFIQTRSPKVTIDCAAFVCGDNTACRQQLVEYGSFCVNVNAILNEVNNLL 813 
PTNFSISITTEVMPVSMAKTSVDCNMYICGDSTECANLLLQYGSFCTQ 756 
PNSFNLTVTDEYIQTRMDKVQINCLQYVCGSSLDCRKLFQQYGPVCDNILSWNSVGQKE 648 

*. * . k . -k k 

SADVSEMLTFDKKAFTLANVSSFGDYN LSSVIPSLPTSGS 684 

NMEVDSMLFVSENALKLASVEAFNSSETLDPIYKEWPNIGGSWLEGLKYILPSHNS 954 

SVEVNSMLT I SEE ALQLAT I S S FNGDG YNFTNVLGASVYDPASGR 887 

DTTQLQVANSLMNGVTLSTKLKDGVNFN VDDINFSPVLGCLGSDCN 909 

DNMQLQVASALMQGVTISSRLPDGISGP IDDINFSPLLGCIGSTCAEDGN 863 

DRNTREVFAQVKQMYKTPTLKYFGGFN- FSQILPDPLKPTK 796 

DMELLNFYSSTKPAGFNTPVLSNVSTG EFNI S LLLTNPS SRRK — — 691 



- — RVAGRSAIEDILFSKLVTSGLGTVDADYKKCTKGLS - - 1 ADLACAQ YYNG IMVLPGV 739 

- - -KRKYRSAIEDLLFDKWTSGLGTVDEDYKRCTGGYD- - 1 ADLVCAQYYNG IMVLPGV 1009 

- - - WQKRSVIEDLLFNKWTNGLGTVDEDYKRCSNGRS - - VADLVCAQYYSGVMVLPGV 942 

KVSSRSAIEDLLFSKVKLSDVG-FVEAYNNCTGGAE — IRDLICVQSYNGIKVLPPL 963 

GPSAIRGRSAIEDLLFDKVKLSDVG-FVEAYNNCTGGQE — VRDLLCVQSFNGIKVLPPV 920 
RSFIEDLLFNKVTLADAG-FMKQYGECLGDIN- -ARDLICAQKFNGLTVLPPL 846 

SEQ ID NO: 6072 RSLIEDLLFTSVESVGLP-TNDAYKNCTAGPLGFFKDLACAREYNGLLVLPPI 743 
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SEQ ID NO: 6053 ADAERMAMYTGSLIGGIALGGLTS AVSIPFSLAIQARLNYVALQTDVLQENQKILA 795 

SEQ ID NO: 6057 ANADKMTMYTASLAGGITLGALGGG AVAIPFAVAVQARLNYVALQTDVLNKN'QQILA 1066 

SEQ ID NO: 6061 VDAEKLHMYSASLIGGMALGGITA AAALPFSYAVQARLNYLALQTDVLQRNQQLLA 998 

SEQ ID NO: 6065 LSENQISGYTLAATSASLFPPWSA AAGVPFYLNVQYRINGIGVTMDVLSQNQKLIA 1019 

SEQ ID NO: 60 69 LSESQISGYTTGATAAAMFPPWSA AAGVPFSLSVQYRINGLGVTMNVLSENQKMIA 976 

SEQ ID NO: 6042 LTDDMIAAYTAALVS6TATA6WTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKQIA 906 

SEQ ID NO: 6072 ITAEMQALYTSSLVASMAFGGITA AGAIPFATQLQARINHLGITQSLLLKNQEKIA 799 

■ • • • • • • 

SEQ ID NO: 6053 ASFNKMTNIVDAFTGVNDAITQTSQALQTVATALNKIQDVVNQQGNSLNHLTSQ 855 

SEQ ID NO: 6057 SAFNQAIGNITQSFGKVNDAIHQTSRGLATVAKALAKVQDVVNIQGQALSHLTVQLQNNF 1126 

SEQ ID NO: 6061 ESFNSAIGNITSAFESVKEAISQTSKGLNTVAHALTKVQEWNSQGSALNQLTVQLQHNF 1058 

SEQ ID NO: 6065 NAFNNALGAIQEGFDATN SALVKIQAWNANAEALNNLLQQLSNRF 1065 

SEQ ID NO: 6069 SAFNNALGAIQDGFDATN— SALGKIQSWNANAEALNNLLNQLSNRF 1022 

SEQ ID NO: 6042 NQFNKAISQIQESLTTTS TALGKLQDWNQNAQALNTLVKQLSSNF 952 

SEQ ID NO: 6072 ASFNKAIGHMQEGFRSTS — LALQQIQDWSKQSAILTETMASLNKNF 845 

***... **..***.* * * 

■ • ■ •■• ■ • • 

SEQ ID NO: 6053 QAISSSIQAIYDRLDTIQADQQVDRLITGRLMLNVFVSHTLTKYTEVRASRQLAQQKVN 915 

SEQ ID NO: 6057 QAI S S S I SDI YNRLDELS ADAQVDRLI TGRLTALNAFVSQTLTRQAEVRASRQLAKDKVN 1186 

SEQ ID NO: 6061 QAISSSIDDIYSRLDILSADVQVDRLITGRLSALNAFVAQTLTKYTEVQASRKLAQQKVN 1118 

SEQ ID NO: 6065 GAISSSLQEILSRLDALEAQAQIDRLINGRLTALNAYVSQQLSDSTLVKFSAAQAMEKVN 1125 

SEQ ID NO: 6069 GAISASLQEILTRLEAVEAKAQIDRLINGRLTALNAYISKQLSDSTLIKVSAAQAIEKVN 1082 

SEQ ID NO: 6042 GAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAA 1012 

SEQ ID NO: 6072 GAISSVIQEIYQQFDAIQANAQVDRLITGRLSSLSVLASAKQAEYIRVSQQRELATQKIN 905 

k k k • • k ••••k,k»kkkkkkk*k • k km 

SEQ ID NO: 6053 ECVKSQSKRYGFCG-NGTHIFSIVNAAPEGLVFLHTVLLPTQYKDVEAWSGLCVDG 970 

•SEQ ID NO: 6057 ECVRSQSQRFGFCG-NGTHLFSLANAAPNGMIFFHTVLLPTAYETVTAWPGICASDG-DR 1244 

SEQ .ID NO: 6061 ECVKSQSQRYGFCGGDGEHIFSLVQAAPQGLLFLHTVLVPGDFVNVLAIAGLCVNG 1174 

SEQ ID NO: 6065 ECVKSQSSRINFCG-NGNHIISLVQNAPYGLYFIHFSYVPTKYVTAKVSPGLCIAG 1180 

SEQ ID NO: 6069 ECVKSQTTRINFCG-NGNHILSLVQNAPYGLYFIHFSYVPISFTTANVSPGLCISG 1137 

SEQ ID NO: 6042 ECVLGQSKRVDFCG-KGYHLMSFPQAAPHGWFLHVTYVPSQERNFTTAPAICHEG 1067 

SEQ ID NO: 6072 ECVKSQSIRYSFCG-NGRHVLTIPQNAPNGIVFIHFSYTPDSFVNVTAIVGFCVKPANAS 964 

kkk k. k kkk ^k . *k *. k # .* 

SEQ ID NO: 6053 TNGYVLRQPNLALYK EGNYYRITSRIMFEPRIPTMADFVQIENCNVTFVNISRS 1024 

SEQ ID NO: 6057 TFGLWKDVQLTLFRN LDDKFYLTPRTMYQPRVATSSDFVQIEGCDVLFVNATVS 1299 

SEQ ID NO: 6061 EIALTLREPGLVLFTHELQTYTATEYFVSSRRMFEPRKPTVSDFVQIESCWTYVNLTSD 1234 

SEQ ID NO: 6065 DRGIAPKSGYFVNVNN TWMFTGSGYYYPEPITGNNWVMSTCAVNYTKAPDV 1232 

SEQ ID NO: 6069 DRGLAPKAGYFVQDDG EWKFTGSSYYYPEPITDKNSVIMSSCAVNYTKAPEV 1189 

SEQ ID NO: 6042 -KAYFPREGVFVFNGT SWFITQRNFFSPQIITTDNTFVSGNCDWIGIINNT 1118 

SEQ ID NO: 6072 QYAIVPANGRGIFIQVN GSYYITARDMYMPRAITAGDWTLTSCQANYVSVNKT 1018 

: . : : * . * : * 

SEQ ID NO: 6053 ELQTIVP-EYIDVNKTLQELSYKL-PNYTVPDLWEQYNQTILNLTSEISTLENKSAELN 1082 

SEQ ID NO: 6057 DLPSIIP-DYIDINQTVQDILENFRPNWTVPELTFDIFNATYLNLTGEIDDLEFRSEKLH 1358 

SEQ ID NO: 6061 QLPDVIP-DYIDVNKTLDEILASL-PNRTGPSLPLDVFNATYLNLTGEIADLEQRSESLR 1292 

SEQ ID NO: 6065 MLNISTP-NLPYFKEELDQWFKNQTSVAPDLSLDY — INVTFLDLQDEMN 1279 

SEQ ID NO: 6069 FLNTSIP-NPPDFKEELDKWFKNQTSIAPDLSLDFEKLNVTLLDLTYEMN 1238 

SEQ ID NO: 6042 VYDPLQP-ELDSFKEELDKYFKNHTSPDVDLGDISG-INASVVNIQKEID 1166 

SEQ ID NO: 6072 VITTFVDNDDFDFNDELSKWJNDTKHELPDFDKFN — YTVPILDIDSEID 1066 
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YTVQKLQTLIDNINSTLVDLKWLNRVETYIKWPWi^LCISVVLIFWSMLLLCCCSTGC 1142 
NTTVELAILIDNINNTLWLEWL^ 1418 
NTTEELRSLINNINNTLVDLEWLNRVETYIKWPWWWLIIVIVLIFW 1352 

RLQEAIKVLNQSYINLKDIGTYEYYVKWPWYWLLIGFAGVAMLVLLFFICCCTGC 1335 

RIQDAIKKLNESYINLKEVGTYEMYVKWPWYVWLLIGLAGVAVCVLLFFICCCTGC 1294 

RLNEVAKNLNESLIDLQE 1222 

RIQGVIQGLNDSLIDLEKLSILKTYIKWPWYVWLAIAFATIIFILILGWVFFMTGC 1122 
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FIGURE 7B 



SEQ ID NO: 6054 MFLKLVDDHA-LVVWLLWCWLIV^ 51 

SEQ ID NO: 6062 MLQLVNDNG - LWNVI LWLFVLFF LL IIS ITF VQLVNLC FTCHRLCNS AVY 54 

.SEQ ID NO: 6058. MTFPRALTVIDDNG-MVINIIFWFLLIIILILLSIALLNIIKLCMVCCNLGRTVII 59 

SEQ ID NO: 6045 MYSFVSEETGTLIVNSVLLFLAFWFLLVTLAILTALRLCAYCCNIVNVSLV 52 

SEQ ID NO: 6073 MNLLNKS LEENG - S F LT AL Y 1 I VGFL AL YLLGRALQ AF VQ AADACC LFWYTWW 57 

SEQ ID. NO: 6066 MFMADAYFADTVWYVGQ I IF IVAICLLVI I VWAFLATFKLC IQLCGMCNTLVL 54 
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FIGURE 7C 



SEQ ID NO: 6055 MSNDNC TGDIVTHLKNWNF 19 

SEQ ID NO: 6063 MSNGSIP VDEVIEHLRNWNF 24 

SEQ ID NO: 6059 MKILLILACVIACACGERYCAMKSDTDLSCRNSTASDCESCFNGGDLIWHLANTOF 60 

SEQ ID NO: 6067 MSSVT-TPAPVYTWT ADEAIKFLKEWNF 27 

SEQ ID NO: 6070 MSSTTQAPEPVYQWT ADEAVQFLKEWNF 33 

SEQ ID NO: 6046 -- MADNGTIT VEELKQLLEQWNL 21 

SEQ ID NO: 6074 MPNETNCTLD FEQSVQLFKEYNL 28 



SEQ ID NO: 6055 GWIWILTIFIVILQFGHYKYSRLFYGLKMLVLWLLWPLVLALSIFDTW 78 

SEQ ID NO: 6063 TWIILTILLVVLQYGHYKYSVFLYGVKMAILWILWPLVLALSLFDAWASF 83 

SEQ ID NO: 6059 SWSIILIVFITVLQYGRPQFSWFWGIKMLIMWLLWPWLALTIFNAYSEYQVSRYVMFG 120 

SEQ ID NO: 6067 SLGIILLFITVILQFGYTSRSMFVWIKWILWLMWPLTIILTIFN — CVYALN-NVYLG 84 

SEQ ID NO: 6070 SLGIILLFITIILQFGYTSRSMFIYVVKMIILWLMWPLTIVLCIFN — CVYALN-NVYLG 90 

SEQ ID NO: 6046 VIGFLFLAWIMLLQFAYSNRNRFLYIIKLVFLWLLWPWLACFVLA^ 78 
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SEQ ID NO: 6074 FITAFLLFLTIILQYGYATRSKVIYTLKMIVLWCFWPLNIAVGVIS — CTYPPN-TGGLV 85 

• • • ic "k » k , k . ■ • . 

SEQ ID NO: 6055 FSFFMAVSTLVMWVMYFMSFRLFR 137 

SEQ ID NO: 6063 FSILMACITLMLWIMYFVNSIRLWRRTHSWWSFNPETDALLTTSVM-GR 142 

SEQ ID NO: 6059 FSIAGAIVTFVLWIJyTZFWSIQLYRRTKSWWSFNPETKAILCVSAL-GRSYVLPLEGVPT 179 

SEQ ID NO: 6067 FSIVFTIVAIIMIWFWSIRLFIRTGSWWSFNPETNNLMCIDMK-GRMYVRPIIEDYH 143 

SEQ ID NO: 6070 FSIVFTIVSIVIWIMYFWSIRLF 149 

SEQ ID NO: 6046 IAIAMACIVGLMWLSYFVASFRLFARTRSIOTSFNPETNILLNVPLR-GTIVTRPIjMESE^ 137 

SEQ ID NO: 6074 AAIILTVFACLSFVGYWIQSIRLFKRCRSWWSFNPESNAVGSILLTNGQQCNFAIESVPM 145 

. . . .... ... ... 

SEQ ID NO: 6055 GITVTLLSGVLYVDGHRLASGVQVHNLPEYMTVAVPSTTIIYSRVGRSVNSQNSTG — WV 195 

SEQ ID NO: 6063 GVTLTLLSGTLLVEGYKVATGVQVSQLPNFVTVAKATTTIVYGRVGRSVNASSGTG — WA 200 

SEQ ID NO: 6059 GVTLTLLSGNLYAEGFKIAGGMNIDNLPKYVMVALPSRTIVYTLVGKKLKASSATG — WA 237 

SEQ ID NO: 6067 TLTVTIIRGHLYMQGIKLGTGYSLSDLPAYVTVAKVSHLLTYKRG-FLDKIGDTSG — FA 200 

SEQ ID NO: 6070 TLTATIIRGHLYMQGVKLGTGFSLSDLPAYVTVAKVSHLCTYKRA-FLDKVDGVSG — FA 206 

SEQ ID NO: 6046 VI GAVI IRGHLRMAGHSLGR- CD IKDLPKE ITVATS -RTLSYYKLGASQRV6TDSG- - FA 193 

SEQ ID- NO: 6074 VLSPIIKNGVLYCEGQWLAK-CEPDHLPKDIFVCTPDRRNIYRMVQKYTGDQSGNKKRFA 204 
: : * * * : . ** ■ * * 

SEQ ID NO: 6055. FYVRVKHGDFSAVSSPMSNMTENERLLHFF 225 

SEQ ID NO: 6063 FYVRSKHGDYSAVSNPSAVLTDSEKYLHLV 230 

SEQ ID NO: 6059 YYVKSKAGDYSTEAR-TDNLSEQEKLLHMV 266 

SEQ ID NO: 6067 VYVKSKVGNYRLPSTQKGSGLDTALLRNNI 230 

SEQ ID NO: 6070 VYVKSKVGNYRLPSN-KPSGADTALLR--I 233 

SEQ ID NO: 6046 AYNRYRIGNYKLNTDHAGSNDNIALLVQ — 221 

SEQ ID NO: 6074 TFVYAKQSVDTGELESVATGGSSLYT 230 



FIGURE 7D 



SEQ ID NO: 6056 — MATVKWADASE PQRGRQG 18 

SEQ ID NO: 6064 MASVSF QDRGRK 17 

SEQ ID NO: 6060 MANQGQRVSWGDEST KTRGRSNSRGRKN 31 

SEQ ID NO: 6068 MSFTPGKQSSS-RASSGNRSGNGILK WADQSDQSRNVQTRGRR-AQPKQTATSQQPS 55 

SEQ ID NO: 6071 MSFVPGQENAGGRSSSVNRAGNGILKKTTWADQTERGPNNQNRGRR-NQPKQTATTQ-PN 58 

SEQ ID NO: 6051 MSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGRNGARPKQRRPQGLPN 48 

SEQ ID NO: 6075 MASGKAAGKTDAPAPVIKLGGPKP PKVGSS 35 

SEQ ID NO: 6056 RIPYSLYSPLLVDSEQPW-KVIPRNLVPIMK-DKNKLIGYWN — VQKRFRTRKGK 70 

SEQ ID NO: 6064 RVPLSLYAPLRVTNDKPLSKVLANNAVPTMG-NKDQQIGYWN--EQIRWRMRRGE 70 

SEQ ID NO: 6060 NNIPLSFFNPITLQQGSKFWNLCPRDFVPKGIG-NRDQQIGYWN — RQTRYRMVKGQ 85 

SEQ ID NO: 6068 GGNWPYYSWFSGITQFQKGKEFEFAEGQGVPIAPGVPATEAKGYWYRHNRRSFKTADGN 115 

SEQ ID NO: 6071 SGSWPHYSWFSGITQFQKGKEFQFAEGQGVPIANGIPASEQKGYWYRHNRRSFKTPDGQ 118 

SEQ ID NO: 6051 NT ASWFTALTQHGK-EELRFPRGQGVPINTNSGPDDQIGYYRRATRR-VRGGDGK 101 

SEQ ID NO: 6075 GNASWFQAIKAKKLNTPPPKFEGSGVPDNENIKPSQQHGYWR — RQARFKPGKGG 88 

* • ** **...* 

• • • . • • . 

SEQ ID NO: 6056 RVDLSPKLHFYYLGTGPHKDAKFRERVEGVVWVAVDGAKTEPTGYGVRRKNSEPEIPHFN 130 

SEQ ID NO: 6064 RIEQPSNWHFYYLGTGPHGDLRYRTRTEGVFWVAKEGAKTEPTNLGVRKASEKPIIPKFS 130 

SEQ ID NO: 6060 RKELPERWFFYYLGTGPHADAKFKDKLDGVVWAKDGAMNKPTTLGSRGANNESKALKFD 145 

SEQ ID NO: 6068 QRQLLPRWYFYYLGTGPHAKDQYGTDIDGVYWVASNQADVNTPADILDRDPSSD— EAIP 173 
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QKQLLPRWYFYYLGTGPHAGASYGDSIEGVFWVANSQADTNTRSDIVERDPSSH — EAIP 


176 


SEQ 


ID 


NO: 6051 


MKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEGALNTPKDHIGTRNPNNN — AATV 


159 


SEQ 


ID 


NO: 


6075 


RKPVPDAWYFYYTGTGPAADLNWGDTQDGIVWVAAKGADTKSRSNQGTRDPDKF — DQYP 


146 










*** **** . .*. *** _ * 


• • • 




SEQ 


ID 


NO: 


6056 


QKLPNGVTWEE PDSRAPSRSQSR 


— SQSRGRGESK 


164 


SEQ 


ID 


NO: 


6064 


QQLPSWEIVEPNTPPASRMSRSRSRGNGNNRSRSPSNNRGNNQSRGNSQNRGNNQGRG 


190 


SEQ 


ID 


NO: 


6060 


GKVPGEFQLEVN QSRDNSRSRSQ 


SRSRSRNR 


176 


SEQ 


ID 


NO: 


6068 


TRFPPGTVLPQGYYIEGS -GRSAPNSR 


— STSRASSRASS 


210 


SEQ 


ID 


NO: 


6071 


TRFAPGTVLPQGFYVEGS-GRSAPASR 


— SGSRSQSRGP 


212 


SEQ 


ID 


NO: 6051 


LQLPQGTTLPKGFYAEGSRGGSQASSR- — 


— SSSRSRGNSR 


196 


SEQ 


ID 


NO: 


6075 


LRFSDGGPDGNFRWDFIPLNRGRSGRS 


— TAASSAAASR 


183 



SEQ 


ID 


NO: 


6056 


PQSRNPSSDRNHN 


SQDDIMKAVAAALKSLGFDKPQEKDKKS 


205 


SEQ 


ID 


NO: 


6064 


ASQNRGGNNNNNNKSRNQSNNRNQSNDRGGVTSRDDLVAAVKDALKSLGIGENPDRHKQ- 


249 


SEQ 


ID 


NO: 


6060 


SQSRGRQQFNNKK 


DDS VEQAVLAALKKLGVDTEKQQQRS - 


215 


SEQ 


ID 


NO: 


6068 


AGSRSRANSGNRT 


PTSGVTPDMADQIASLVLAKLGKDAAKP 


251 


SEQ 


ID 


NO: 


6071 


NNRARSSSNQRQ 


PASTVKPDMAEEIAALVLAKLGKDAGQP 


252 


SEQ 


ID 


NO: 6051 


NSTPGSSRGNS 


PARMASGGGETALALLLLDRLNQLESKV 


235 


SEQ 


ID 


NO: 


6075 


APSREGSRGRR 


SDSGDDLIARAAKIIQDQ 


212 



SEQ ID 
SEQ ID 
SEQ ID 
SEQ ID 
SEQ ID 
SEQ ID 
SEQ ID 



NO: 6056 
NO:6064 
NO:6060 
NO:6068 
NO: 6071 
NO:6051 
NO:6075 



AKTGTPKPSRNQSPASSQTSAKSLARSQSSETKEQKHEMQKPRWKRQPNDDVTSNVTQCF 265 



— -QQKPKQEKSDNSGKNTPK- 

RSKSKERSNSKTRDTTPK- 

— -QQVTKQTAKE IRQKILN- 
— — KQVTKQSAKEVRQKILN- 
SGKGQQQQGQTVTKKSAAEASK- 
QKKGSRITKAKADEMAH- 



-KNKSRATSKERDLKDIPEWRRIPKG — ENSVAACF 300 

NENKHTWKRTAGK GDVTRFY 253 

KPRQKRSPNK — QCTVQQCF 286 

KPRQKRTPNK — QCPVQQCF 287 

-KPRQKRTATK — QYNVTQAF 275 

RRYCKRT I P P — NYRVDQVF 247 



SEQ ID NO: 6056 GPRDLDH— NFGSAGWANGVKAKGYPQFAELVPSTAAMLFDSHIVSKESG 314 

SEQ ID NO: 6064 GPRGGFK NFGDAEFVEKGVDASGYAQIASLAPNVAALLFGGNVAVRELA : 349 

SEQ ID NO: 6060 GARSSSA NFGDTDLVANGSSAKHYPQLAECVPSVSSILFGSYWTSKEDG 302 

SEQ ID NO: 6068 GKRGPNQ NFGGGEMLKLGTSDPQFPILAELAPTAGAFFFGSRLELAKVQNLSGNLDE 343 

SEQ ID NO: 6071 GKRGPNQ NFGGSEMLKLGTSDPQFPILAELAPTVGAFFFGSKLELVKKN — SGGADE 342 

SEQ ID NO: 6051 GRRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTP 327 

SEQ ID NO: 6075 GPRTKGK-EGNFGDDKMNEEGIKDGRVTAMLNLVPSSHACLFGSRVTPKLQL 298 

•k -k -k -k -k -k . 



SEQ 


ID 


NO: 


6056 


SEQ 


ID 


NO: 


6064 


SEQ 


ID 


NO: 


6060 


SEQ 


ID 


NO: 


6068 


SEQ 


ID 


NO: 


6071 


SEQ 


ID 


NO:6051 


SEQ 


ID 


NO: 


6075 



- - -NTWLTFTTRVTVPKDHP HLGKFLEELNAFTR EMQ 349 

DS YE IT YNYKMTVPKS DP NVELLVSQVDAFKTGNAK- LQRKKEKKNKRETTLQ 401 

DQIEVTFTHKYHLPKDDP KTGQFLQQINAYAR PSEVAKEQR 3 43 

PQKDVYELRYNGAIRFDSTLSGFETIMKVLNENLNAYQQQD GTMNMSPKPQRQ — R 397 

PTKDVYELQYSGAVRFDSTLPGFETIMKVLNENLNAYQKDG GADWSPKPQRKGRR 398 

- —SGTWLTYHGAIKLDDKDPQFKDNVILLNKHIDAYKTFP PTE- - 368 

DGLHLRFEFTTWPCDDPQFDNYVKICDQCVDGVGTRPKDDEPKPKSRSSSRPATRG 355 



SEQ 


ID 


NO: 


6056 


SEQ 


ID 


NO: 


6064 


SEQ 


ID 


NO: 


6060 


SEQ 


ID 


NO: 


6068 


SEQ 


ID 


NO: 


6071 



QHPLLNPSALEFNPSQTSP ATAEPVRDEVS IETDI IDEVN 389 

QHEEAI YDDVGAPSDVTHANLEWDTAVDGGDTAVE I INE IFDTGN- — 446 

KRKSRS KS AERS EQDWPDALI ENYTDVFDDTQVE I IDEVTN 3 85 

GQKNGQGENDNISVAAPKSRVQQNKIRELTAEDISLLKKMDEP FTEDTSEI 448 

QAQEKKDEVDNVSVAKPKSSVQRNVSRELTPEDRSLLAQILDDGWPDGLEDDSNV- — 4 5 4 
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SEQ ID NO: 6051 - PKKDKKKKTDE AQPLPQRQKKQPTVTLLPAA- — 399 

SEQ ID NO: 6075 NSPAPRQQRPKKEKKLKKQDDEADKALTSDEEI^NAQLEFYDEPKVIIMGDAALGENEL 414 
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FIGURE 7G 
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FIGURE 8 



_ . — _ Section 1 

(1) 1 J0_ go J30 42 

avian IBV partial 5'UTR 161- (1) TATTAAAATCTTATTGTTGCTGGTATCACTGCTTGTTTTGCC 



HCoV-OC43 5'UTR (1) 
bovine CV 5'UTR (1) 
Consensus (1) 



— - ■* — ■ — ■ — — Section 2 

(43) 43 ffQ jBQ^ ; 70 _ 84 

avian IBVpartiai 5'UTR 161- (43) GTGTCTCACTtItataCAT^^ 

HCoV-OC43 5'UTR (1) — - 

bovine CV 5'UTR (1) 

Consensus (43) 

. — - . — — . — _ — , , — _ . — Section 3 



(85) 85_ _ J90 ._JO0 _ J10_ _ . 126 

avian IBV partial 5'UTR 161- (85) ~C AGC G TC C TACGGGCGTC G T GGCTGg'tTC GAG T G C G AG GAAC 

HCOV-OC43 5'UTR (1) • ■ - ■ 

bovine CV 5'UTR (1) . 

Consensus (85) 

— — ■ — — ■ — — - - — ■■ ■- — — — Section 4 

(127) 127 ,140 _ ,150 168 

avian IBVpartiai 5'UTR 161- (127) C T CTGGTT cCdfc TAGCi \G i\\GGC^G]^ST^i3^G^AG^[G^AC 

HCoV~OC43 5'UTR (1) • GAT T G T G A 0 CG A TT TGCGTG CG T GCA — TC C CG C 

bovine CV 5'UTR ( 1 ) GAT TGCG A'G CG'AT TTGCGT G C G T GCA- -X CC'CG C 

Consensus (127) GATTG GAGC G AT T T G C G 5 G C G TG C A TCCCGC 
— — . — _ . ....... .• . . „ .. Section 5 



(1 69) 169 180 1 90 _ _200 210 

avian IBV partial 5'UTR 161- (169) T T G A G A CGT A C ,C G G t"t C TG T T G t" GTGAAAJ A ~ - ~C G G G g'^F^AC 

HCoV~OC43 5'UTR (33) TT'CA JC TG ATC XC TT. GT T A GAT C i TT TTGT A A XO'TA 

bovine CV 5'UTR (33) XTCA — — j^TS A T C T'C T T GT T A G A T G T T T T OA TA A T C T A 

Consensus (169) ttca ctgatctcttgttagatcttttcg 

~— — , __ , — _ . ,„ . Section 6 



(211) 211 220 £30 £40 . 252 

avian IBV partial 5'UTR 161- (209) C T CC C C C C^C^TA CCTC T AA G G G CTTTTGAGCC T AG C G T T G G 
HCoV-OC43 5'UTR (68) AACT TT ATAAA'AAC ATCC ACTCC CT GTAATCTAT GCTTGTG G 
bovine CV 5'UTR (68) A AC T T T A T AAA A A C A T C C A CTCC O TGT AG TOT A T G C C T G T G G 
Consensus (21 1) AACT T TAT AAA A AC ATCC AC T C C C TGT AG TC T A TG C C T G TGG 

■ — — — - - — — ■ — • — — -.-Section 7 

(253) 253 .260 £70 280 294 

avian IBVpartiai 5'UTR 161- (251) I^TAC G T T : CT CG C AT AAGGT c'gg:ct A? AC G AGGT TT g'SS^GG 
HCOV-OC43 5'UTR (110) jSCGT AG AT-TT TTC^tP^GTGGTGTTTATAa! T - CAT T TO 4- GOT 
bovine CV 5'UTR (110) GCGT AG A XT TTT C AT AG T GGTGTC T A T AT T - b*AT TTCT ~GCT 
Consensus (253) G C G T AG A TTT T T C ATAG T G G T G T C TAT AT T CAT T TOT GCT 



Section 8 



(295) 295_. J300 5 310 320 336 

avian IBV partial 5'UTR 161- (293) piGj^ 

HCoV-OC43 5'UTR (150) G T T AA C AGC T T T-C A : G CCAGG jG AC GT GT T GT A T CC TA G G G 

bovine CV 5'UTR (150) GTTAACAGCTTYCAGCCkGGGACgTGXTGTATCCTApGC 

Consensus (295) G T T A AC AGC T T T C AG C C AG G G AC GTGTT G TATCC TAG G C 
— — — — - — Section 9 



(337) 337 ,350 360 373 



SEQ ID NO: 9910 -(335) TTT ^ G T SAjG C|A G iACjA T a!c A| A T A Gj^C AjG jx Gj A C A A C A T G 

SEQIDNO: 9919 :(189) ^GXd~&cjc^ - - 

SEQ ID NO: 9892 :(189) !^X&-jGticCA£^ 

consensus (337) AGTG GCC C AC C CAT AG G TCACAATG 
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FIGURE 9 



F1:AT ^ G l G c^ A ^ G f G ™ 



SEQ ID NO: 

(136-154 nt) 6021 



F2: GTG^GTG^AT^-C^CTTCA 

O C CC G 



(152-172 nt) 



6022 



F3: CTTCAC^G^TCT%TGT|jGA 
T A C TA 



(168-195nt) 



6023 



R1 . AGA ACCTGT C AC ^ TC ^g^ G XT G (307-329nt) 
R2: AAAf G^fTAT A cf^C f^T ATG (265-288nt) 



*GA„ CT 



AA AC AC 



R3: °%5° ac"TATG Af C-^GCCCA (250-274nt) 



6024 
6025 
6026 
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FIGURE 10 



avian IBV 3'UTR (NCJ)01451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



avian IBV 3'UTR (NCJD01461) 271 03- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



— _ Section 1 

(1) J 10 go 36 

(1) GTAACATAATGGACCTGTTGTTTCCTGGTACATTTT 
(1) 

(1) 

(D 

. _ — r Section 2 

(37) 37 ,50 ,60 72 

(37) GTTAAACACTATTTCTG TG C T T TCCT AT C A AT TAT T 

(1) 

(1) 

(37) 



avian IBV 3'UTR (NCJ3Q1451) 27103- 
HCoV-OG43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



avian IBV 3'UTR (NCJ301451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



(73) 
(73) 
(1) 
(D 
(73) 

(109} 
(109) 
(1) 
<D 
(109) 



73 



,80 



SO 



Section 3 
108 



ACAGGCATTGATTGTGATTATGTTCAATACTTAAGC 



,120 



.130 



Section 4 
144 



109 _ 

TTCTTTTGGTTGCTTTTTGCTTATTGTATTGTTGCT 



avian IBV 3'UTR (NCJDG1451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



(145) 
(145) 
(1) 
(1) 
(145) 



145 



150 



160 



^ Section 5 

170 180 



GTGCTTTTTATTGTTGTGATTCTCATTSgTTJpl--^ 



»TA'AGAGAA ^3;A»Sj 



181 



,190 



200 



A GAGAATGAAC 

__ Section 6 

216 



avian IBV 3'UTR (NC_0Q1451) 27103- 
HCoV-OC43 3*UTR partial 
bovine CV 3*UTR 
Consensus 



(181) _ ______ ______ _ _ _ _ 

(1 79) TT T A ^CG T Aj^A A^jT T C AA^A-GC^MVG A~G tSIa AaG^jA^T 

(11) 
(181) 



avian IBV 3'UTR (NCJ)01451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



v cttat gtcggcacctggtggtaacccctc GCAGG 

- ■■ ~ • ■ - ■ Section 7 

217 230„ ,„ . _ _ g40_ 

gT^G^ATpT AG C T T G A TTA'CCTAC A TG T C%S?gjG C 



(217) 
(215) 

(48) A A A G T G'&GG 1 A TAAG G C A C 

(45) AA&G Tt G Q<4 AT A A GGC A C 

(2 1 7) AAAGTCG G G 



-TCTCJATCAG 



ATAAGGCAC TCTCTATCAG 
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FIGURE 10 (contd.) 



_ , — _ — — — — ■ - — Section 8 

(253) 253 _ ^ £60 _ 270 _ 288 

avian IBV 3'UTR (NCJ301451) 27103- (251) C AG G GAAA TGTCT AAT CTGT Cp^C^T AG'Sft^C C T gSH 

HCoV~OC43 3'UTR partial (76) A A T G G A — TGTCTTGCTGCT A TXAjT*A G AT A G A, jg 

bovine CV 3'UTR (73) AATGGA — TGTCTTGCTGCT A T A A T-A G A T A G A §g 

Consensus (253) AAT GG A TGTCT T GCT GC TATA ATA G A TAG A G 

_ — ___ . „ _ Section 9 

(289) 289 300 ,310 324 

avian IBV 3'UTR (NC_001451) 27103- (287) J^CGA^CGfST^GACCCTTAGAf TTTAATTTAGTTT 

HCoV-OC43 3'UTR partial (107) AAGGTTATAGCAGACTAT- AGATT.' AATTAGTTG 

bovine CV 3'UTR (1 04) A AG G T T A T A G C AG ACT AT - AG AT % AA«T T A G T T G 

Consensus (289) AAGGTTATAGCAGACTA¥ AGATT AATTAGTTG 
_ _ _ „ . _ Section 10 

(325) 325 ,330 340 350 360 

avian IBV 3'UTR (NCJ301451) 27103- (323) A JpT T^lA^ST^Ap^T T AAG TK^pfl^SAgppT^Gp 
HCoV-OC43 3'UTR partial (139) ^AAGTTT;TGTGTggtaatgiaatagt 1 gttggagaaa ? G 
bovine CV 3'UTR (136) A. A A G T T T\T GTGTG G T A A T G T A T A G T;G T T GO A C AAA G 
Consensus (325) AAAGTTT T GTGTG G T A A T G T AT AG T gFtGGAGAAAG 

- — ■ ^ • — Section 1 1 

(361) 361 370 380 396 

avian IBV 3'UTR (NCO01451) 27103- (358) iAT^^STGCCAG^ggc^GGfecc^C-tdGGlGT^C 

HCOV-OC43 3'UTR partial (175) fe'G - A A A G A0 : T' TGC G G AAGiTA AiT T GGCGA.CA&G 

bovine CV 3'UTR (172) fe-AAAGACT TGCGGAAGTAA'TTCCCGACAAG 

Consensus (361) TG A A AG ACT T GC G G AAG T A AT T G CC G AC AAG 

■ : — - — — Section 12 

(397} 397 410 420 . j432 

avian IBV 3'UTR (NCJ>01451) 27103- (393) GATiGlG^|TAC^CApTif Gil®CCCA#TlGgGG^ 

HCOV-OC43 3'UTR partial (206) TGCCC AAA'GGGAAGAGCCAGCACQ .TTAAGITT^ 

bovine CV 3'UTR (203) TGCCCAAGGGGAAGAGCCAGCATqj [TTAAG^^ 

Consensus (397) T G C C C A A GG G G A AG A G C C AG C A C G TTAAGTTA 

— — ■ ■ Section 13 

(433) 433 ,440 450 468 

avian IBV 3'UTR (NC_001451) 27103- (429) AGpG^T^AAT TSJl? AC3 Tj — T T:AA^G^;T|A:I^G^-!E T^A^T - T 
HCoV-OC43 3'UTR partial (238) CCACCCAGTAATTAGtfAAATGAATGAAGT.T AATT.AtI 
bovine CV 3'UTR (235) C 0 & T C C A G T AAT T A G T A A A TG A A T G A A G T T A A T T A T 
Consensus (433) CCA C C A G T AAT T A G T A A AT G A A T G A AGT T A ATT A T 

. — „ _ — . - - - . ,. - _ Section 14 

(469) 469 480 490 _ 504 

avian IBV 3'UTR (NCJ3Q1451) 27103- (462) fee GTAAG r ATA'GTTSAAlTTTATAGGCTAGTATAGA 

HCOV-OC43 3'UTR partial (274) G G C G A A T T G G A A G A AT.CjAR 

bovine CV 3'UTR (271) p&dc!%A*T^ 



(505) 505 _ 513 

avian IBV 3'UTR (NC_001451) 27103- (498) GTTAGAGCA SEQ ID NO: 9911 

HCOV-OC43 3'UTR partial (293) — SEQ ID NO: 9920 

bovine CV 3'UTR (290) SEQ ID NO: 9893 

Consensus (505) 
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FIGURE 11 



F-1 TCTATC-f^A — GGATGTCT 

AGA T 



(245 ~ 265 nt) 



SEQ ID NO: 

6027 



F-2 TTAGTT J AA— TTT £ GT^ T^GT 
G AG T G G 



(318-339 nt) 



6028 



F-STAGTGTT^GAG-^A^GT^TAAAGA (346-368 nt) 



r-1 a^tt|gccata^t^aactt 



(458 ~ 476 nt) 



6029 
6030 



R-2 ACTAA^T^G ^T-^T^C — TAA 
AATT A T CT CC C 



R -3 T ^TC^GC^T-^C^C-^GCA 
AC C G GG CC G 



( 426 ~ 448 nt) 



( 375 ~ 395 nt) 



6031 

6032 
6033 



FIGURE 12 



Coils outpui for unknown 
, 1 



i 



window 14 
wimfow=£l 
wiiadpw=£8 



£00 406 600 800 



1300 1400 
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LearnCoil score using viral membrane-fusion -table for pep4 
-i r 
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FIGURE 13 
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FIGURE f 4 



1b 
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FIGURE 15 



Catalytic Cys 



SEQ ID NO: 6569 




Catalytic His 




1 sgfrkrnafps gkvegcmvqy bcwtttlngl i<rlddtvycpr Ihj^ictaeditil 

61 ksnhsf Ivqa gnvqlrvigh srtK$lncllr lk vdtsnpktpk ykfvriqpgq 

121 spsgvyqcam rpnhtikgqfjlngscl^svgf nidydcvsf c [^^kmelptg 

181 fygpfvdrqt aqaagtdt-t Ti^g^nvlaTjorlya avingdrTfff Ji*m?ft:tt Indf 

2 41 pltqdhvdil gplsaqtgia vlc&ta^aalke llqnginng^ /ilgstilede 
301 sgvtf q 




npnyedllir 
tfsvlacyng 
vhagtdlegk 
nlvamkynye 
ftpfdwrqc 



Key residues of the substrate site 
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avian IBV mpZ 
MHV mp2 
SARSnsp2 
BCoV nsp2 
Consensus 



{1} 
(1) 

m 
m 
m 
m 



scpnawspf 



1S;A¥ EKCIVgvPYR<3Srai*tiaLWW3DP|^ CPU 
ISK^EpCXVS 



<S2) 

avian IBV nspS (49) 
MHVaspt m$ 
SARBmp2 (52J 
BCoVnapa (52) 
Consensus {52} 



attian ; IBVmp2 000) 

SARS- fi$p2 (102) 
BCoV nsjp2<102) 
-Gor^naua(1G3) 



(134) 

avian IBV mp2 (1S1) 
MHV rtsp2 (103) 
SARS nsp2(153) 
SCo¥nap2|153> 
Consensus (154) 



— — — - — — - — — — , — — Seotion 1 

I ,10 2 0^ ,30 _ SL 51 

|V$S ~KFQG 

VI Cj||&E. DHUff 

SGXVKMVSFS &KVEPC I VSVTyGHMTL^ QIiWLDDTV YC PRBVXC 3 A&PHTtf 
— — — — — — ..— — - — — — — , — Ssclion 2 

52 7 0 BO _ . SO tQ2 

'\LQT UA E T P K Y 

LTV MS Y QMQ^CBL¥IxT¥^ tftMS pT P K Y 
PDY ifiiCEtrisSDF VLSGR 'VSLTVHSYQ^^ 

— — — — : — Section 3 

103 ; ,1 10 _ ,120 . J3Q 14 0 „IP 

:^PVRjj0f» a^fp^liltC tpelf ; S 0¥§3C AM Mil? £^^LNG s * 
TFOVVKPG F 'I* r I'^L^A * 1 4 G K y O G Jkffff V TM B' 1 * $ J C * J £fa 

■ ■ h. . : — -, - ■ : — — — — — — Ssclion 4 

204 



svcirvu 



154. 



,iao 



(20SJ 

avian !B¥ nsp2 (202) 
MHV nsp2 (204) 
BARS nsp2 (204) 
BC0Vosp2:f2O41 
Consensus (20B) 



■MGEMg VgPli^iy^'B tfl^iaeMipT 0 ^HOgFf Q#i:i£ikO¥8C|ii^& O Y I M< 
DGDCVkVvYK^^ aDPYGPYVDAQWQLPVQDYTQTVKf 

— _ — Section 6 

2 0S 210 220 .230 240 2S5 

vv.^/it porch - - - - - -vp l*:?c u- ac j vv: * m e^^m% : ^y£n>~-~& 

^WLx ^1.1B|©DR — ^ WF^RF¥%^R©> tll^ ^>MK ¥ 13 V u' F L fQUHVU 
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FIGURE 17 
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FIGURE 19 
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FIGURE 20 
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FIGURE 21 

FIGURE 21 A 

G1 cons 



AIBV 




100 } — - G2 cons 
SARS 





WO 2004/092360 



83/193 



PCT/US2004/011710 



FIGURE 23 

LPRKSQPTSISCRSVLi-TNFKICVAVARLHA-CTYAV-TIINFTVVDKKRVTRPSSADCL 
RFRPCCSRSSAYLGFVRV-PKGKMESLVLGWEKTHVQLSLiPVLrQVRDVLVRGFGDSVEE 
ALSEAREHLKNGTCGLVELEKGVLPQLEQPYVFIKRSDALSTNHGHKWELVAEMDGIQY 
GRSGITLGVLVPHVGETPIAYRWLLRKN-GNKGAGGHSYGIDLKSYDLGDELGTDPIEDY 
EQNWNTKHGSGALRELTRELN^ 

QLDYIESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFPL 
NSKVKVIQPRVEKKKTEGFMGRIRSVYPVASPQECKbMHLSTLMKCN 

KATCEHCGTENIiVIEGPTTCGYLPTNAWKMPCPACQDPEIGPEHSVADYHNHSNIETRLi 
RKGGRTRCFGGCVFAWGCYNKRAYWPRASADIGSGHTGITGDWETLNEDLLEILSRE 
RVNINIVGDFHLNEEVAIILASFSASTSAFIDTIKSLDYKSFKTIVESCGNYKVTKGKPV 
KGAWNIGQQRSVLTPIiCGFPSQAAGVIRSIFARTLDAANHSIPDLQRAAVTILDGISEQS 
LRLVDAMVYT S DLLTNS VI IMAYVTGGLVQQT S QWL SNLLGTTVEKLRP I FEWIEAKL S A 
GVEFLKDAWE I LKFL I TGVFD I VKGQI QVASDNI KDC VKCF I DWNKALEMC IDQVT I AG 
AKLRSLNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKE^ 

LKNGELEALETPVDSFTNGAIVGTPVCWGLMLLEIKDKEQYCALSPGLLAT3MVFRLKG 
GAP IKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNEKC S VYTVE SGTEVTEFAC WA 
EAWKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAE 
CEEEEIDETCEHEYGTEDDYQGIjPLEFGASAETVRVEEEEEEDWLDDTTEQSEIEPEPEP 
T PEE PWQFTGYLKLTDNVAIKCVD I VKEAQ S AN PMVI VNAAN I HLKHGGGVAGALNKAT 
NGAMQKESDDYIKLNGPLTVGGSCLLSGHNLAKKCLHW 

SQDILLAPLLSAGIFGAKPLQSLQVCVQTVRTQVYIAWDKALYEQWMDYLDNLKPRVE 
APKQEEPPNTEDSKTEEKSWQKPVDVKPKIKACIDEVTTTLiEETKFLTNKLLLFADING 
KLYHDSQTOLRGEDMSFLEKDAPYMVGDVITSGDITCWIPSKKAGGTTEMLSRALKKVP 
VDEYITTYPGQGCAGYTLEEAKTALKKCKSAFWLPSEAPNAKEEILGTVSWNLREMLAH 
AEETRKLMP I CMDVRAIMATI QRKYKGIKI QEG IVDYGVRFFFYTSKEPVAS I ITKLNSLr 
NEPLVTMPIGYVTHGFNLEEAARCMRSLKAPAWSVSSPDAVTTYNGYLTSSSKTSEEHF 
VETVSLAGSYRDWSYSGQRTELGVEFLKRGDKIWHTLESPVEFHLDGEVLSIiDKLKSLL 
SLREVKTIKVFTTVDNTN^ 

LPSDDTLRSEAFEYYHTLDESFLGRYMSALNHTKKWKFPQVGGLTSI^ 

LALQQLEVKFNAPALQEAYYRARAGDAANFCALILAYSNKTVGELGDVRETMTHLLQHAN 
LESAKRVIj]>T^CKHCGQKTTTLTGVEAVMYMGTLS YDNLKTGVS I PCVCGRDATQYLVQQ 
ESSFVMMSAPPAEYKLQQGTFLCANEYTGNYQCGHYTHITAKETIjYRIDGAHIjTKMSEYK 
GP VTDVF YKET S YTTT I KPVS YKLDGVT YTE I E PKLDGY YKKDNAY YTEQ P I DLVPTQ PL 
PNASFDNFKLTCSNTKFADDLNQMTGFTKPASRELSVTFFPDLNGDWAIDYRHYSASFK 
KGAKLLHKPIVWHINQATTKTTFKPNTWCLRCLWSTKPVDTSNSFEVLAVEDTQGMDNLA 
CESQQPTSEEWENPTIQKEVIECDVKTTEWGNVILKPSDEGVKVTQELGHEDLMAAYV 
ENTSITIKKPNELSLALGLKTIATHGIAAINSVPWSKILAYVKPFLiGQAAITTSNCAKRL, 
AQRVFNNYMPWFTLLFQLCTFTKST^ 

PKFSKLFTIAMWLLLLSICLGSLICVTAAFGVLLSNFGAPSYC^^ 

FCEGSFPCSICLSGLDSLDSYPALETIQVTISSYKLDLTILGLAAEWVLAYMLFTKFFYL 

LGLSAIMQVFFGYFASHFISNSWLMWFIISIVQMAPVSAWRMYIFFASFYYIWKSW 

MDGCTSSTCMMCYKRNRATRVECTTIVNG 

GSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVAVKNGALHLYFDKAGQKTYERHPLSH 
FWLDNLRAMTKGSLPINVIVFDGKSKCDESASKSASVYYSQLMCQPILLLDQALVSDV 
GD S TE VS VKIMEFDAYVDTF S ATF S VPMEKLKALVATAHS EL AKGVALDGV S TFVS AARQG 
WDTDVDTKDVIECLKLSHHSDLEVTC^ 
AQVAKSHWSLIWIWKDYMSLSEQLRKQIRSA^ 

KGGKI VSTCFKLMLKATLLCVL AALVC Y I VMPVHTLS I HDG YTNE 1 1 G YKAI QDGVTRD I 
ISTDDCFANKHAGFDAWFSQRGGSYKNDKSCPWAAIITREIGFIVPGLPGTVLRAINGD 
FLHFLPRVFSAVGNICYTPSKLIEYSDFATSACVLAAECTIFKDAMGKPVPYCYDTNLLE 
G S I S YS ELRPDTR YVLMDG S 1 1 QF PNT YLEG SVRWTTFDAE YCRHGTC ERS E VG I C L S T 
SGRWLNNEHYRALSGVFCGVD^ 

AYYFMKFRRVFGEYNHWAANALLFLMSFTILCLVPAYSFLPGVYSVFYLYLTFYFTNDV 
SFLAHLQWFAMFSPIVPFWITAIYVFCISLKHCHWFF3SnNTYLRKRVMFNGV 
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CTFLLNKEMYLKLRSETLLPLTQYNRYLALYNKYKYFSGAL^ 

F SNS GADVL YQ P P QT S I T S AVL Q S GFRKMAF P S GKVEGCMVQ VTC GTTT LNGLWLDDTVY 
CPRWICTAEDMIiNPNYEDLLIRKSNHSFLVQAGWQLRVIGHSMQNCLLRLKVDTSNPK 
TPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCV 
SFCYMHHMELPTGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLWLAWLYAAVINGDR 
WFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVL 

GRTILGSTILEDEFTPFDWRQCSGVTFQGKFKKIVKGTHHWMLLTFLTSIjLILVQSTQW 

SLFFFWENAFLPFTLGIMIAACMLLVKHKHAFLCLFLLPSLATVAYF1WYMPASWV 

MRIMTWLELADTSLSGYRLKDCWASALVLLILMTARTVYDDAMRWTLM^ 

VYYGNALDQAI SMWALVI SVTSNYSGWTTIMFLARAIVFVCVEYYPLLFITGNTLQC IM 

LVYCFLGYCCCCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKSSIDAFKL 

NIKLLGIGGKPCIKVATVQSKMSDVK^ 

AKDTTEAFEKMVSLLSVLLSMQGAVDINRLCEEML^ 

QEAYEQAVANGDSEWLKKLKKSLWAKSEFDRDAA^ 

KRAKVTSAMQTMLFTMLRKLDNDALNNI I NNARDGC VP LN 1 1 PLTTAAKLMVWPDYGTY 
KNTCDGNTFTYASALWE I QQ WDADSKI VQL S E INMDNS PNLAWPL IVTALRANSAVKLQ 
NNEL S PVALRQMS C AAGTTQTACTDDNALAYYNNSKGGRF VLALLSDHQDLKWARFPKSD 
GTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLI^ 
PANSTVLSFCAFAVDPAKAYKDYLASGGQ 

FGGASCCLYCRCHIDHPNPKGFCDLKGKWQIPTTCANDPVGFTLRNTVCTVCGMWKGYG 
C SCDQLREPLMQS ADASTFLNGFAV- VQPVIiHRAAQALVLMS STGLL IFTTKKLiLVLQS S 
- KL I AVAS RRRMRKA I Y - TLTL - LRG I LC LTTNMKRLF I TWLKIVQRLLSMTFSSLE-MV 
TWYHXYHVSV-LNTQWLI-SMLYVILMRVIVIH-K^ 

LT S YAYMLT - VS VYANH Y - RL YNS AMLC VMQ AL - AY - H - 1 1 R I LMGTGT I SVI SYK-HQA 
AEFLLIWIHITHC-CPSSL-LGHWLLSPIWMLI^ 

IGTRHTIPIVLTVWMIGVSFIVQTLMCYFLLCFHLiQVLiDH EKYL-MVFLLLFQLDTIF 

VS-ESYIIRM-TYIARVSVSRNF-CMDLIQLCMQLLAIYC-INALHAFQ-DH-QTMLLFK 

LSNPVILIKTFMTLLCLKVSLRKEVLLN-NT^ 

SYS-LKLLINTLIVTMVAVLMPTK-SLT^ 

RILSWSSLL-LK-ILSMPLVQRIELAP-LVSLSWL-QIDSFIRNY-SQ-PPLEELLW- 

LEQASFTVAGIIC-KLFTVM-KLHTLWVGIIQNVTEPCLTCLG-WPLLFLLANITLAVTY 

HT VS TG - LT S VRKY - VRWS C VAAH YMLNQ VEHH P VMLQLLML I VS LTF VKLLQPM-MHFF 

QLjMVIR-LTSMSAIYNTGSMSVSIEIGMLIMNSWMSFTLTCVNISP — FFLMMPLCAITV 

TMRLKV--LALRTLRQFFIIKIMCSCLRQWGLRLTLLKDL^^^ 

CLTQIHQEY-AQAVLSMILSKQMVHL-LKGSCHWLLMLT^ 

E S YMMS LL ATC W 

VPVLGDHSYVASAAMTMSFQHHTN-CCLLIPMFAMPQVVMSL 

LPLVFHWLMVRFLWTKTHV-AVTMSLTSMR-QHVIGLMLAITYLPTLVLRDSSFSQQK 
RSKPLRKHLSCHWLPLYAKYSDTENCIFHGRLENLDHH-TETMSLLVTV-LKIVKYRLiE 
STPLKKVTMVMLLCTEVLRHTS-MLVIT^ 

TSQMSFLAMLQIIKRSACKSTLHSKDHLVLVRVILPSDLLSIT^ 
WKRH-NICP-INWESYLRVRA-SVLINSK-IQH^ 

LWLL IMT - VLSMLDFVQNTTS I LAI LLNYQPPAHC -LKAH-NQNI L IQC ADL -KQ-VQTC 
SLELVAWLLKLLTL-VL-FMTIS-KHTRISQLNASKCSTKVLLHMMFHLQSTDLK-AL- 
ENFLHAILLGEKLFLSHLIIHRTL-LQKS-DCLRRLLIHHRVLNMTMSYSHKLLKQHTLV 
MS TASMWL S QGQKLAF C A- C L I E I FMTNCNLQV- KYHVAMWLH YKQKM- LDFLRTWRS L 
LVFILHRHLHTSALI-SSRLKDYVLTYQAYQRT-PTVDSSL-WVSK-ITKSMVTLICLSP 
AKKLFVTFVRGLALM-RAVMQLEMLWLTYLSS-DFLQVLT — LYRLVMLTLKITQNSPE 
LMQNLHQVTSLNILYHSCIKACPGM-CVLR-YKCSVIH-KDCQTESCSSFGRMALSLHQ- 
S TL SRLDLKERWC VT NVQLAFLLHQ I LMPAG 1 1 LWVLTMS I THL - LMF S SGALRVTFRV 
TMTN I ARYMEMHMWLWML S - LDV- Q SMS ALL S AL I GLLNTLL - EMN- GL I LLAEKYNTW 
L-SLHCLLISFQFFMTLEIQRLSSVCLRLK-NGSSTMLSHWTKLTK-RJNTSSILMLHITI 
NSLWFVCFGIVTLIVTQPMQLCVGLTQESCQT-TYQAVMVWCM-ISMHSTLQLSIKVH 
LLI-SNCLSFTILIVLVSLMANK-CRILIMFHSNLLRVLHDAI-WLFADTMQMSTDSTW 

MHII FLLDLAYGFTNNLILITCGIHLPGYRV-KMWLIMLLIKDTLMDTPAKHLFPSLI 

MLFTQR-MVLMWRSLKIRQHFLLMLHLSFGLSVTLNQCQRLRYSIIWVLISLLIL-SGTT 
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KEKPQHMYLQ-VSAQ-LTLPR^^ QKVQS 

KV-HLQRDQHKLASMESH-LENQ-KHSLTTL^ 

WKLTF S S S LWMNS Y S D I S S RAMP SNT S FME I S VMDNL AVF I A-PSAHKIHHLN-RIL 

S LWT AQ -KITS- QMRKQVH QNVC VL -LI F YLMTL S R — SHKICQ-FQKWSRLQLTMLKFH 

SCFGVRMDMLKPSTQNYKQVERGNQVLRCL 

MSQSILNCWT-IHLL-LYPTT-ELFTLVLALIK^ 
SSPTHILL-LETVQQYIRLINGTLLLAICMTLGP3SM-Q 

WWL - Li -R- Q S I LGMLTFT S LWAI SHGGQLLLQM-MHHHRKHF - LGLT I LASRRNKLMAI 
PCMLTTFSGGTQILSSCLPIHSLT-ANFLLN-EEL^ 

EKTTELWFQVIFLLTTKRTCLFSYYFLLSLVWTLTGAPLLmFKLLITLNILHL-GGFT 

ILMKFLDQTLFI-LRIYFFHFILMLQGFILLIIRLATLSYLLRMVFILL 

FLVLP-TTSHSR-LLLTILLMLLYEHVTLNCVTT^ 

STYLMPFRLMFQKSQVILNTYESLCLKIKMGFSMFIRAINL-M-FV^ 
CLLVLTLQILEPFLQPFHLLKT^ 

HLLNSNALLRALRLTKEFTRPLISGLFPQEML-DSLILQTC^ 
EKKFLIVLLITLCSTTQHFFQPLSAMAFLPLS-MIFASPMSMQILL-^ 

LVLLLI I 1 INCQMI SWWSLLGILGTLMLLQLVI 1 1 INIGILDMASLGPLRETYLMCLSP 
LMANLAPHLLLIVIGH-MIW^ 

TSVS I L I LMDS LVLVC - LLLQRX)FNHFNNLAVMFLISLIPFEILKHLKY- 
V-LHLEQMLHLKLLFYIKMLTALMFLQQFMQINSHQLGAYILLETMYSRLKQAVL-ELSM 
STLLMSATFLLELAFVLVTIQFLYYWLAKNLLWLILCL-VLIVQLLTLITPLLYLLTFQ 
LALLQK-CLFLWLKPP-IVICTSAEILLWLICFS3>JMVAFAHN-IVHSQVLLLNRIATHV 
KCSLKSNKCTKPQL-NILWLIFHKYYLTL-SQLRGLLLRTCSLIR-HSLMLAS-SNMAN 
A-VILMLEISFVRRSSMDLQCCHLCSLMI-LLPTLLL-LWLPLLDGHLVLALLFKYLLL 
CKWHIGSMALELPKiyiFSMRTKNKSPTNLTRRLVKFKNHLQQHQLHWASCKT 
THLLNNLALILVQFQVC-MISFRDLIKSRRRYKLTG-LQADFK^FKPM-HNN--SGLLKSG 
LLLILLLLKCLSVFLDNQKELTFVERATTLCPSHKQPRMVLSSYMSRMCHPRRGTSPQRQ 
QFVMKAKHTSLVKVFLCLMALLGLLHRGTSFLHK-LLQTIHLSQEIVMSLLASLTTQFMI 
LCNLSLTHSKKSWTSTSKI I HHQML I L ATFQALTLL S S TFKKKLTASMRS LK I -MNHSLT 
FKNWENMSNI LNGLGMFG S ASLLD - LP S SWLQSCF VA- L WAVASRVHALWLAAS LMRM 
TLSQFSRVSNYITHKRTYGFVYEIFYSWINYCTASKN-QCFSCKYCSCYSNDTATSLTPF 
RMACYWRCISCCFSERYQNNCAQ-KMAASPL-GLPVHLQFTAAICYHLFTSFACRCRYGG 
AIFVPLCLDIFSTMHQRM-NYYEMLALLEVQIQEPITL-CQLLCLLAHT-L-LLYTI-QC 
HRYNCRY-R-RHFNTKTQRRLPNWWLF-G-ALRC-RLCRCTWLFHRSLLPA-VYTNYYRH 
WY-KCYILHL-QAC-RPTECANTHNRRLFRSC-SSNGSNL — ADDDY-RAFVSTRK-VRT 
YVLIRFGRNRYVNS — RTSFSCFRGILASHTSHPYCASIVCVLLQYC-REFSKTNGLRLL 
AC-KSELF-RSS-SSGLNELTIIIILFGTLTLLIMADNGTITVEELKQLLEQWNLVIGFL 
FLAWIMLLQFAYSNRl^RFLYIIKLVFLWLLWPVTLACFVLAAVYRINWVTGGIA 
VGLMWLSYFVASFRLFARTRSMWSFNPETNILLNVPLRGTIVTRPLMESELVIGAVIIRG 
HLRMAGHSLGRCDIKDLPKEITVATSRTLSYYKLGASQRVGTDSGFAAYNRYRIGNYKLN 
TDHAGSNDNIALLVQ-VTTDVSSC-LPGYNSRDIDYHYEDFQDCYLES-RYNKFNSETII 
-ASN-EELFGVR — RT YGVRL S I KRT - KLF S S - H - L YLHL AS Y I T I RS VLE VRL Y Y - KNL 
AHQEHTRAI HHFTLLLT INLH - LALAHTLLLLVLTVLD I P I S C VQDQFHQNF S SDKRRFN 
KSSTRHFFSLLLL-YF-YFASPLRERQNE-AHFN-LLFVLFSLSAIPCFNNAYYILVFTR 
NPGSRRTLYQSLNEHETSHCFDLYFSMQLHMHCSTALCI — TSCA— RSL-GTTLGVILIA 
LLGFVL-ERFYLFIDGTLWFKHAHLMLLSTVKIQLWRL-LGVGTFMKVTKLLHLETYLL 
F-INEQIKMSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGRNGARPKQRRPQGLPKNTAS 
WFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRVRGGDGKMKELSPRWYFY 
YLGTGPEASLPYGANKEGIVWATEGALNTPKDHIGTRNPl^NAATVLQLPQGTTLPKGF 
YAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETALALLLLDRLNQLE 
SKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAFGRRGPEQTQGNFGDQDL 
IRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYHGAIKLDDKDPQFKDNVI 
LLNKHIDAYKTFPPTEPKKDKKKKTDEAQPLPQRQKKQPTVTLLPAADMDDFSRQLQNSM 
SGASAI)STQA-TLMMTTQGRWAM-TFSQFRLRYIVYSCAE-ILVTKQHK-V-LTLISHSN 
L-SMCNIREDLKEPPHFHRGHAEYDRGYSE-C-GELPIWKSPNV-N-F— CYPHVILIAS 
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FIGURE 24 

TQEKPTNLDLL-ICSIJ^ 

VSSVLQSIISIPRFRPGVTER-DGEPCSWCQRENTRPTQFACPSG-RRASAWLRGLCGRG 
PIGGT-TPQKWHLWSSRAGKRRTAPA-TALiCVH-TF-CLiKHQSRPQGR-AGCRNGRHSVR 
S - RYNTG S TRATCGRNPNC I P QC S S S - ER — GSRWS-LWHRSKVLi-LR-RAWH-SH-RL- 
TKLEH-AWQWCTP-THS-AQWRCSHSLCRQQFL 

T-LHRVEERCLLLP-P~A-NCLVH~AL--ELRAPDTLRN-ECQEI-HFQRGMPKVCVSS- 
LKSQSHSTTC - KEKD - GFHGAYTLC VP C C I S TGV- Q YALVYLDEM - S LR - S FMADVRL S E 
SHL-TLVm-KFSY-RTYYMWPTY-CCSENAMSCLSRPRDWT-A-CCRLSQPLKH-NSTP 
QGR-D-MFWRLCVCIjCWLiIj — ACLIjGSSC-C-YWIjRPYWHYW-QCGDIjE---GSP---DTES--T 
C-H-HCWRFSFE-RGCHHFGIFLCFYKCLY-HYKES~LQVFQNHC--VIiR--L-SYQGKARK 
RCLEHWTTEISFNTTWFSLTGCWCYQINFCAHT-CSKPLNS-FAKSSCHHT-WYF--TVI 
T S CRRHGL YFRPAHQQCH Y YG I CNWWS CTTDF SVW- S FGHYC - KTQAYL -MD-GET-CR 
S-ISQGCLGDSQISHYRCF-HRQGSNTGCFR-HQGLCIGy[LH-CC-QGTR]WH-SSHYRWR 
KV"AITQLR-SLHRSKQGTLPSWTWQGAAATTHAS-GTKRSNLS-R-FT-HSTYLi--GGCS 
QER-TRSTRDAR--LHKWSYRRHTSLCKWPHALRD--GQRTIIJRIVSWFTGYKQCLSL1KRG 

CTN-RCNLWRRYCLGSSRLQECENHI-A TC-QSA — KVLCLHC-IRYRSY-VCMCCSR 

GCCEDFTTSF - SPYQHGY- S — VECSYIL,IiI--CW-RKLFITYVI J FLLPSR-GRRGRCRV 
-GRRN--NL-T-VRYRG-LSRSPSGIWCLS-NSSS-GRRRGRLAG-YY-AIRD-ARTRTY 
T~RTS-SVYWLFKTY-QCCH-MC-HR-GGTKC-SYGDCKCC-HTPETWWWCSRCTQQGNQ 
WCHAKGE- — LH-AKWPSYSRRVLFAFWT-SC-EVSACCWT-PKCR-GHPAS-GSI-KFQF 
TGHLTCTIWSRHIWC-TTSVFTSVRADGSYTGIiYCSQ-QSSL-AGCHGLS — PEA-SGS 
T-TRGATKHRRFQN-GEICRTEACRCEAKN-GLH — GYHNTGRN-VSYQ-VTLVC-YQW- 
AliP-FSEHA-R-RWFP-EGCTLHGR-CYH-W-YHLCCNTLQKGWWHY-DAIjKSFEESAS 
--WNHVPWTRMCWLYT-GS-DCS-EMQICILCTTFRST-C-GRDSRNCILEFERNACSC 

-RDKKINAYMHGG-SHNGNHPT-V— RN-NSRGHR-LWCPIIjIjIjY RACSFYYYEAELSK 

- AACHNANWLCDTWF - S - RGC AL YAF S - S S CRS VS 1 1 TRCC YYI - WI PHF VI KDI - GALC 
RNSFFGWLLQRLVLFRTAYRVRC-IS-AW-QNCVPHSGEPRRVSS-R-GSFT-QTKESLI 
PAGG-DYKSVHNCGQH-SPHTACGYWDIWTAVWSNILGWC-CYKN-TSCKS-G-DFLCT 

T HTT — SFRVLPYS EFSW-VHVCFKPHKEMEISSSWWFNFN-MG QKLFV-CFI 

STSTA-SQIQCTSTSRGLL-SPCW-CC-IjLCTHTRLQ — NC WRAW- C QRNYDP S S TAC - F 
GICKASS~CGV-TL J WSENYYL I NGCRSCDWGYSIL--S~DRCFHSMCVWS-CYTISSTTR 
VFFCYDVCTTC-V-ITARYILMCE-VHW~LSVWSLHSYNC---GDPIiSY-RSSPYKDVRVQR 
TSD-CFLQGNILHYNHQACW-TRWSYLHRD-TKIGWVL-KG-CLLYRAAYRPCTNSTIT 
KCEF~~FQTHMF-HKIC--FKSNDRI j HKASFTRAICHILiPRIjEWRCSGY--I j -TI j FSEFQE 
RC-ITA-ANCLAH-PGYNQDWQTKHLVFTLSLEYKASRYFKFI-SSGSRRHTRNGQSCL 
-KSTTHD-RSSGKSYHTEGSHRV-RENYRSCRQCHT-TIR-RC-SNTRVRS-GSYGCLCG 
KHKHYH-ET--AFTSLRFKNNCHSWYCCN — CSLE-NFGLCQTILRTSSNYNIKLR-EIS 
TTCV-QLYALCWIIVPIVYFY-KYQF-N-SFTTYNYC-K-C-ECC-IMFGCRH--IJCEVT 
QIF-IVHNRWAIWKYLLRFSNLCNCCFWCTLI-FWCSFLL-WR-RIVS-FV-RYYYGF 
L-RFFSDQHLFKWIRLP-FLSSS-NHSGDDFIVQARLDNFRSGR-VGFGIYWHKILLiFI 
RSFSYNAGVmiiFC-SFHQQFLAHWYH-YCTNGTRFCNG-DVHLLCFFIjIjHMEELCSYH 
GWLHLFDLHDVL-AQSCHTR-VYN 

-YIH SCS-FVTPV-KTNQPY-PVIVYC — CCCEKWRASPLL-QGWSKDLi-ETSALPF 

CQFRQFES-QH-RFTAY-CHSF-WQVQMRRVCF-VCFCVLQSADVPTYSVA-PSSCIRRW 
R-Y-SFR-DV-CLCRHLFSNF-CSYGKT-GTCCYSSQRVSKGCSFRWCPFYIRVSCPTRC 
C~YRC-HKGCY-MSQTFTSL,-LRSDR-QL-QFHAHL — G - KHDAQRS WRMY - L - C KAYQC 
PSSKKSQCFTHLECKRLHVFI-TAA-TNS-CCQEEQHTF-TNIjCYN-TGCQCHISrY-NIjTQ 
GW-DC-YLF-TYA-GHIIVRSCCIGLLYRYASTYIWP-WLHK-NHWLQSHSGWCHS-HH 

FY LFCK-TCWF-RW-PAWWFIQK-QKLPCSSCYHYKRDWFHSAWLTGYCAESNQW-Li 

LAFSTSCF-CCWQHL1L1HTFQTH-V — FCYLCIjRSCC-VYNF-GCYGQTCAILjIj-H-FARG 
FYFL--ASSRHSLCAYGWFHHTVS-HLPGGFC-SS]SnsrF-C-VI J ~TWYMRKVRSRYLPIYQ 

W-MGS ALQSSIRSFLWC-CDESHS-HLYSSCATCGCFRCVCFSSGWWYYCHIGDLCC 

LLLYEIQTCFW-VQPCCCC-CTFVFDVFHYTLSGTSLQLSAGSLLSLLLVLDIDFHQ-CF 
ILGSPSWCHVFSYCAFLDNSNLCILYFSEALiPLVL-QI J S-EKSHV-WSYI--YLRGGCFV 
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YLFAQQGWPKIA-RDTVATYTV-QVSCSI-QVQVFQWSLRYYQLS-SSLLPLSKGSK-L 
-QLRC-CSLPTTTDINHFCCSAEWF-ENGIPVRQS-RVHGTSNLWNYNS-WIWG--HSID 
SKTCHLHSRRHA- S -L -RSAHSQI QP-LSCSGWQCSTSC YWPFYAKLSA- A- S - YF -P-D 
TQV-ICPYPTWSNIFSSSMLQWFTIWCL 

FLLYASYGASNRSTRWY-LiRR-ILWSIC-QTNCTGCRYRHNHNIKCFGMAVCCCYQW — V 

VS — IHHYFE-L-PCGNEVQL-TFDTRSC-HIGTSFCSNRNCRLRWCCFERAAAEWYEW 

SYYPW-HYFRR-WTI-CC-TMLWCYLPR-VQENC-GHSSDDAFNFLDITIDSCSKYTW 

TVFLCLRECFLAIYSWYYGNCCMCYAAC-A-ARILVLVSVTFSCNSCLIj-YGLHAC-LGD 

AYHDMA-IG— H-LVWL-A-GLrCYVCFSFSFAYSHDSSHCLi — CC-TCLDTDECHYTCLiQS 

LLW-CFRSSYFHVGLSYFCNIj-LFWCRYDY]^ 

CLLFLRLLLLLLLWPPLPTQPLIiQAYSWCL-IiLGL 

H-WGYWR-TMYQGCYCTV-W-RKVHICGTALGSSTT--SRVIF--IVGTMCTTPQ-YSSC 
KRHN-SFREDGFSFVCFAIHAGCCRH — WRGNAR- PCYSSGYCFRI -FFTI ICRLCHCP 
GGLi-AGCS-W-F-SRSQKVKEIFECG-I-V-P-CCHATQVGKDGRSGYDPNVQTGKI-GQ 

EGKSN-CYAMNALHYA-EA CT-QHYQQCA-WLCSTQHHTIDYSSQTHGCCP-IiWYIiQ 

EHL-W-HLYICICTLGNPASC-CG-QDCST--N-HGQFTKFGLASYCYSSKSQLSC-TTE 
— TESSSTTTDVLCGWYHTNSLiY — QCTCLL-QFEGR-VCAGITIRPPRSQMG-I P-E-W 
YRYNLHRTGTTL-VCYRHTKRA-SEILVLHQRLKQPK-RYGAGQFSCYSTSSGWKCYRST 
CQFNCAFLLCFCSRPC - S I -GLPSKWRTTNHQDCEDWYTHWYRTGNYCNTRS -HGPRVL 
WWCFMDSVL— MPH-PSKS-RIIj-LER— VRPI5TTYHI1C — PSGFYT-KHSLYRLiRNVERLWIj 
-L-PTPRTLDAVCGCINVFKRVCGVSAARLTPCGTGTSTDWYRAFDIYNEKVAGFAKFL 
KTNCCRFQEKDEEGNLLDSYFWKRHTMSNYQHEETIYNLVKDCPAVAVHDFFKFRVDGD 
MVPH I SRQRDTKYTMADLVYALRHFDEGNCDTLKE I LVTYNCCDDDYFNKKDWYDFVENP 
DILRWANLGERVRQSLLKTVQFCDAM^^ 
GVPIVDSYYSLLMPILTLTRALAAESHM^ 

'WDQTYHPNCINCLDDRCILHCANFNVLFSTVFPPTSFGPLVRKIFVDGVPFWSTGYHFR 
ELGWHNQDVNLHSSRLSFKELLVYAADPAMHAASGNLLLDKRTT^ 

VKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQL. 
LFWEWDKYFDCYDGGCINMQVIVMfLDKSAGFPFNKWGKARLYYDSMSYEDQDALFA 
YTKRNVI PTI TQMNLKYAI SAKNRARTVAGVS ICSTMTNRQFHQKLLKS IAATRGATWI 
GTSKFYGGWHNMLKTVYSDVETPHL^ 

HRF YRLr ANE C AQ VL S EMVMC GG S L YVKPGGT S S GD ATT AYAN S VFN I C Q AVT ANVNAL L S' 
TDGNKIADKWRNLQHRLYECLYRNRDVDHEFVDEFYAYLRKHFSMILSDDAWCYNSN 
YAAQGLVASIKNFKAVIjYYQNNVFMSEAKCWTETDL 

PYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIR 
KLHDELTGHMLDMYSVMIjTNDNTSRYWEPEFYEAMYTPHTVIjQAV 

ACIRRPFLCCKCCYDHVISTSHKLVIjSVNPYVCNAPGCDVTDVTQIjYIjGGMSYYCKSHKP 

pisfplcangqvfglykntcvgsdw^ 

IiKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNRNWFTGYRVTKNSKVQIGE 

YTFEKGDYGDAWYRGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLN 

ISDEFSSWANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDAL 

CEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTWALPETTADIWFDEIS 

MATNYDLSVVNARLRAKHYWIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGP 

LGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGWR 

EFDTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCN 

WRFOTAITRAKIGILCIMSDRDLYDKLQFTSLEIPRRWATLQAENVTGLFKDCSKIIT 

GLH PTQAPTHL SVD I KFKTEGLC VD I PG I PKDMT YRRL I SMMGFKMNYQWGYPNMF I TR 

EEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 

NAKPPPGDQFKHLIPLMYKGLPWNVWIKIVQMLSDTLKGLSDRWFVLWAHGFELTSM 

YFVKIGPERTCCLCDKRATCFSTSSDTYACWNHSVGFDYVYNPFMIDVQQWGFTGNLQSN 

HDQHCQVHGNAWASCDAIMTRCLAVHECFVKRVDWSVEYPIIGDELRWSACRKVQHMV 

VKSALLADKFPVLHDIGNPKAIKCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYATHHDK 

FTDGVCLFWNCIWimYPANAIVCRFD^ 

TNLKQLPFFYYSDSPCESHGKQWSDIDYVPLKSATCITRCNLGGAVCRHHANEYRQYLD 
AY3MMISAGFSLWIYKQFDTYNLMTFTRLQSLEWAY3WWKGHFDGHAGEAPVSIIM 
AVYTKVDG IDVE I FEJSTKTTL PVNVAFELWAKRNI KP VPE I K I LNNDGVD I AANTVI WD YK 
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RE APAHVST I GVCTMTDI AKKPTE SAC S SLTVLFDGRVEGQVDLFRNARNGVLi I TEGS VK 

GIjTPSKGPAQASVNGVTLIGESVKTQFNYFKKVIDGIIQQIjPETYFTQSRDIjEDFKPRSQM 

ETDFLELAMDEFIQRYKLEGYAFEHIWGDFSHGQLGGLHLMIGLAKRSQDSPLKLrEDFI 

PMDSTVKNYF I TDAQTGS SKCVCSVIDLKLDDFVE I IKSQDLSVI SKWKVTIDYAEI SF 

MLWCKDGHVETFYPKLQASRAWQPGVAMPN 

VAKYTQLCQYLNTDT^^ 

VSDAYSTLIGDCATVHTANKWDLIISDMYDPRTKHVTKENDSKEGFFTYLCGFIKQKLAL 
GGS I AVK I TEH S WNADL YKLMGHF S WWTAFVTNVNAS S S E AFL I GANYLGKPKEQ I DG YT 
MHANYIFWRNTNPIQLSSYSLFDMSKFPL^^ 

ENNRVWS S D I LVNN- TNMF I FLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYY 
PDEIFRSDTLYLTQDLFLPFYSlWTGFHTINHTFGNPVIPFKDGIYFAATEKSlSr^RGW 
FGSTMNNKSQSVIIIN^ 

YISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDWRDLPSGFNTLKPIFKL 
PLG INI TNFRAI LTAF S P AQD I WGT S AAAYFVG YLKPTTFMLKYDENGT I TDAVDC S QNP 
LAELKCSVKSFEIDKGIYQTSNFRWPSGDWRFPNITNLCPFGEVFNATKFPSVYAWER 
KKISNCVADYSVLYNSTFFSTFKCYGVSATKmDLCFSlWYADSFVVKGDDVRQIAPGQT 
GVIADYNYKLPDDFMGCVIjAWNTRNIDATSTGNYNYKYRYIjRHGKIjRPFERDI SNVPFS P 
DGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVWLSFEIjIjNAPATVCGPKLSTDLIKN 
QCWFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPKTSEILDISPCAFGGVS 
VITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRIYSTGNWFQTQAGCLIGAEHV 
DTSYECDIPIGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSSIAYSNNTIAIPTNFSI 
S ITTEWPVSMAKTS VDCNMYI 

VFAQVKQMYKTPTLKYFGGFNF SQIL PDPLKPTKRS F I EDLLFNKVTL ADAGFMKQ YGE C 

LiGD INARDL I C AQKFNGLTVLP PLLTDDMI AAYTAALVS GTATAGWTFGAGAALQ I PFAM 

QMAYRFNGIGVTQNVLYENQKQ I ANQFNKAI SQ I QESLTTTSTALGKLQDWNQNAQALjN 

TLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTWTQQLIRAAEIRA 

S ANIj AATKMS EC VLiGQ S KRVDF C GKG YHLMS F P Q AAPHG WFLHVT YVP S QERNFTT AP A 

ICHEGKAYFPREGVFVFNGTSWFITQRNFFSPQI ITTDNTFVSGNCDWIGI INNTVYDP 

LQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVWIQKEIDRLNEVAKNLNESLID^ 

QELGKYEQYIKWPWYVWIjGFIAGIjIAIVMVTIIjIjCCMTSCCSCDKGACSCGSCCKFDEDD 

SEPVLKGVKLHYT-TNLWICL^ 

DGDLLALHFLLFFRAL^ 

NFCTSMP-YIFYNASTHVELL-DVGFVGSANPRTHYFlvnyEPTTIiFAGTHITMTTVYHITVS 
QIQDSLLKVTAFQHQNSKKTT^^ 

VLKMLHSSSLTSLLKTHRMCKYTQSTALQELLIQQWIQFMMSRRRL 

CTHSFRKKQVR--LIAYFFFLLSWYSC-SH-PSLLRFDCVRTAAILLT-V--NQRFTSTR 
VLKI-TLLKEFLIFWSKRTNYYYYSVWNFNIAYHGRQRYYYR-GA-TTPGTMEPSNRFPI 
PSLDWTTICLF-SEQVFVHNKACFPLALVASNTCLFCACCCLQN-LGDWRDCDCNGLiYC 
RLDVA-LLRCFLQAVCSYPLNWIQPRJSTKHSSQCASPGDNCDQTAHGK-TCHWCCDHSW^ 
LANGRTLPRAL-H-GPAKRDHCGYITNAFLLQIRSVAACRH-FRFCCIQPDPYWKL-IKY 
RPRR-QRQYCFASTVSDNRCFILLTSRLQ-QRY-LSL-GLSGLLFGILTL — VQ — DNYL 
SL-LRRI IRS -MMKNLWS - 1 IHKTNMKI ILFLTLIVFTSCELYHYQECVRGTTVLiLKEPC 
PSGTYEGNSPFHPLADNKFALTCTSTHFAFACADGTRHTYQLRARSVSPKIjFIRQEEVQQ 
ELYSPLFLIVAALVFLILCFTIKRKTE-MSSL-LTSICAF-PFCYSLiF — CLLYFGFHSK 
SRI- KNL VPK S KRT - NF S LF - L VF L YAVAYALi - Y SAVHL I NLMC LK I LVRYNTRGNT Y S T 
AWLCALGKVLPFHRWHTWQTCTP1WTINCQDPAGGALIARCWYLHEGHQTAAFRDVL 
LNKRTN-NV — WTPIKPT-CPPHYIWWTHRFN-Q-PEWRTQWGKAKTAPTPRFTQ-YCVL 
VHSSHSAWQGGT-IPSRPGRSNQHQ-WSR-PNWL^ 

PRNWPRSFTSLRR-QRRHRMGCN--GSLEYTQRPHWHPQS — QCCHRATTSSRNNIAKRLL 
RRGKQRRQSSLFSLLIT-SR-FKKFNSWQQ-GKFSCSNG--RRW-NCPRAIAARQIEPA-E 
QSFW-RPTTTRPNCH-EICC-GI-KASPKTYCHKTVQRHSSIWETWSRTNPRKFRGPRPN 
QTRN- LQTLAANCT I C SKCLC I LWWTHWHGSHTFGMyjADLSWSH-IG-QRSTIQRQRHT 
AEQAH-RIQNIPTNRA-KGQKEKD — S S AFAAETKEAAHCDS S SCG-HG- FLQTTSKFHE 
WSFC-FNSGINTHDDHTRQMGYVWFAIPFTIHSL^ 

LINV-H-GGLERATTFSSRPRGVRSRVQ-IMLGRAAYMEEP-CVKLILWLSPCDFNSFL 
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FIGURE 25 

PTPRTLDAVCGCINVFKRVCGVSAARLTPCGTGTSTDWYRAFDIYNEKVAGFAKFLKTNCCRFQE 

KDEEGNLLDSYFWKRHTMSNYQHEETIYNLWDCPAVAWDFFKFRVDGDMVPHISRQRLTKYTM 

ADLWALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQSLLKTV 

QFCDAMRDAGIVGVLTLDNQDLNGEn^^ 

MDADLAKPLIKWDLLKYDFTEERLCL^ 

PTSFGPLVRKIFVDGVPFWSTGYHFRELGVVHNQDWLHSSRLSFKELLWAADPAMHAASG 

LDKRTTCFSVAALT3N1WAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYD 

YYRYNLPTMCDIRQLLFWEVVDKYFDCYDGGCINANQVIVNNLDK 

YEDQDALF AYTKRNVI PT I TQMNLKYAI S AKNRARTVAGVS I C STMTNRQFHQKLLKS I AATRGAT 

WIGTSKFYGGWHNMLKTW 

YRLANECAQVLSEMVMCGGSLYVKPGGT^ 

YVRNLQHRLYECLYRNRDVDHEFVDEFYAYLRKHFSMMILSDDAVVCYNSNY 

VLYYQN3WFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYWLPYPDPS 

DGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIR 

EPEFYEA3VTYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVC 
NAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDWTN 
AGDYILANTCTERLKLFAAETLK^^ 

TGYRVTKNSKVQIGEYTFEKGDYGDAW^RGTTTYKLWGDYFVLTSHTVMPLSAPTLVPQEHYVR 
ITGLYPTLNISDEFSSIWAlSnirQKVGMQKYSTLQGPPGTGKSHFATGLALYYPSARIVYTACSHAA^ 
DALCEKALKYLPIDKCSRIIPARARVECFDKFKWSTLEQWFCTWALPETTADIWFDEISMAT 
NYDLS WNARLRAKHYVY I GDP AQLPAPRTLLTKGTLE PE YFNS VCRLMKT I GPDMFLGTCRRC PA 
EIVDTVSALWDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGWREFLTRNPAWRKAVFI 
SPYNSQNAVASKILGLPTQTVDSSQGSEYDWIFT^ 

RDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKIITGLHPTQAPTHLSVDIKFKTEGLCVDIP 
GI PKDMTYRRLI SMMGFKMSTYQW PLQL 
GFSTGVNLVAVPTGYVDTENNT 

GLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWNHSVGFDYVYNPFM, 

IDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWSVEYPIIGDELRVNS- 

ACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYATH 

HDKFTDGVCLFWNCNYDRYPANAIVCRFDTRVLSNLNLPGCDGGSLYWKHAFHT 

KQLPFFYYSDSPCESHGKQWSDIDWPLKSATCITRCNLGGAVCRHHANEYRQYLDAYISnyiMISAG 

F S L WI YKQ FDT YNLWNTF TRL Q S LE3WAYNVVNKGHF DGHAGE AP VS I INNAVYTKVDGIDVEIFE 

NKTTL PVNVAF EL WAKRN I KPVPE I K I LNNLGVD I AANT V I WD YKRE AP AHVS T I GVCTMTD I AKK 

PTESACSSLTVLFDGRVEGQVDLFRNARNGVLITEGSVKGLTPSKGPAQASVNGVTLIGESVKTQF 

JNT^FKKVDGIIQQLPETYFTQSRDLEDFKPRSQMETDFLELAMDEFIQRYKLEGYAFEHIVYGDFSH 

GQLGGLHLMIGLAKRSQDSPLKLEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKS 

QDLSVISKWKVTIDYAEISFMLWCKDGHVETFYPKLQASRAWQPGVAMPNLYKMQRMLLEKCDLQ 

NYGENAVIPKGIMMIWAK^ 

LVDSDLNDFVSDAYSTLIGDCATVHTANKWDLIISDMYDPRTKHVTKENDSKEGFFTYLCGFIKQK 

LALGGSIAVKITEHSWNADLYK^ 

NYIFWRNTNPIQLSSYSLFDMSKFPLK^^ 

DILVNN*Tm^ 

LFLPFYSIWTGFHTINHTFGNPVIPFKDGIYFAATEK^ 

VIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCTFEYISDAFSLDVSEKSGNFKHLREFVFKN 
KDGFLYVYKGYQPIDWRDLPSGFNTLKPIFKLPLGINITNFRAILTAFSPAQDIWGTSAAAYFVG 
YLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKSFEIDKGIYQTSNFRWPSGDWRFPNITN 
LCPFGEVFNATKFPSVYAWERKKISNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSF 
VVKGDDVRQIAPGQTGVIADYJNTZKLPDDFMGCV^ 

DISWPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVWLSFELLNAPATVCGPKLSTDL 
IKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPKTSEILDISPCAFGGVSVIT 
PGTNASSEVAVLYQDWCTDVSTAIHADQLT^ 

IGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSSIAYSl^TIAIPTNFSISITTEVMPVSMAKTS 
VDCNMYICGDSTECANLLLQYGSFCT^ 

FSQILPDPLKPTKRSFIEDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGLTVLPPLLTD 
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DMIAAYTAZ^IjVSGTATAGWTF 

I QE S LTTTSTALGKLQDWNQNAQALNTLVKQL S SNFGAI S SVLND I L S RLDKVE AEVQ IDRL I TG 
RLQSLQTWTQQLIRAAEIRA^ 

TYVPSQERNFTTAPAICHEGKAYFPREGVFVFNGTSWFITQRNFFSPQIITTDNTFVSGNCDWIG 
IINNTVYDPLQPELDSFKEELDK^ 

IDLQELGKYEQYIKWPWYWLGFIAGLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSEP 
VLKGVKLHYT* 



FIGURE 26 



FIGURE 26A 



FIGURE 26B 
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FIGURE 27 
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FIGURE 29 
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FIGURE 31 
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FIGURE 34 
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FIGURE 35 
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FIGURE 36B 
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FIGURE 36C 
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FIGURE 37 
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AAGCTTACAAAACAAA 



D D V Q A 
GAT GAT GTT CAA GCT 
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R G V Y Y 
AGG GGG GTT TAG TAT 
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TAT TTA ACT CAG GAT 
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G F H T I 
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TTT AAG GAT GGT ATT 
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V R G W V 
GTC CGT GGT TGG GTT 
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TCG GTG ATT ATT ATT 
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C N F E Ij 
TGT AAC TTT GAA TTG 



P M G T Q 
CCC ATG GGT ACA CAG 

150 

N C T F E 
AAT TGC ACT TTC GAG 



S E K S G 
TCA GAA AAG TCA GGT 
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K N K D G 
AAA AAT AAA GAT GGG 
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CGA GAG TTT GTG TTT 
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AAG GGC TAT CAA CCT 



WO 2004/092360 



113/193 



PCT/US2004/011710 



I D V V R 
ATA GAT GTA GTT CGT 

210 

P I F K L 
CCT ATT TTT AAG TTG 



A I L T A 
GCC ATT CTT ACA GCC 

240 

S A A A Y 
TCA GCT GCA GCC TAT 



M L K Y D 
ATG CTC AAG TAT GAT 

270 

C S Q N P 
TGT TCT CAA AAT CCA 



F E I D K 
TTT GAG ATT GAC AAA 

300 

V P S G D 
GTT CCC TCA GGA GAT 



C P F G E 
TGT CCT TTT GGA GAG 

330 

Y A W E R 
TAT GCA TGG GAG AGA 



S V L Y N 
TCT GTG CTC TAC AAC 

360 

G V S A T 
GGC GTT TCT GCC ACT 



Y A D S F 
TAT GCA GAT TCT TTT 

390 

A P G Q T 
GCG CCA GGG CAA ACT 
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D L P S G 
GAT CTA CCT TCT GGT 



P L G I N 
CCT CTT GGT ATT AAC 

230 

F S P A Q 
TTT TCA CCT GCT CAA 



F V G Y L 
TTT GTT GGC TAT TTA 

260 

E N G T I 
GAA AAT GGT ACA ATC 



L A E L K 
CTT GCT GAA CTC AAA 

290 

G I Y Q T 
GGA ATT TAC CAG ACC 



V V R F P 
GTT GTG AGA TTC CCT 

320 

V F N A T 
GTT TTT AAT GCT ACT 



K K I S N 
AAA AAA ATT TCT AAT 

350 

S T F F S 
TCA ACA TTT TTT TCA 



K L N D L 
AAG TTG AAT GAT CTT 

380 

V V K G D 
GTA GTC AAG GGA GAT 



G V I A D 
GGT GTT ATT GCT GAT 



F N T L K 
TTT AAC ACT TTG AAA 

220 

I T N F R 
ATT ACA AAT TTT AGA 



D I W G T 
GAC ATT TGG GGC ACG 

250 

K P T T F 
AAG CCA ACT ACA TTT 



T D A V D 
ACA GAT GCT GTT GAT 

280 

C S V K S 
TGC TCT GTT AAG AGC 



S N F R V 
TCT AAT TTC AGG GTT 

310 

N I T N L 
AAT ATT ACA AAC TTG 



K F P S V 
AAA TTC CCT TCT GTC 

340 

C V A D Y 
TGT GTT GCT GAT TAC 



T F K C Y 
ACC TTT AAG TGC TAT 

370 

C F S N V 
TGC TTC TCC AAT GTC 



D V R Q I 
GAT GTA AGA CAA ATA 

400 

Y N Y K L 
TAT AAT TAT AAA TTG 
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P D D F M 
CCA GAT GAT TTC ATG 

420 

I D A T S 
ATT* GAT GCT ACT TCA 



L R H G K 
CTT AGA CAT GGC AAG 

450 

V P F S P 
GTG CCT TTC TCC CCT 



N C Y W P 
AAT TGT TAT TGG CCA 

480 

G I G Y Q 
GGC ATT GGC TAC CAA 



L L N A P 
CTT TTA AAT GCA CCG 

510 

D L I K N 
GAC CTT ATT AAG AAC 



T G T G V 
ACT GGT ACT GGT GTG 

540 

F Q Q F G 
TTT CAA CAA TTT GGC 



R D P K T 
CGA GAT CCT AAA ACA 

570 

F G G V S 
TTT GGG GGT GTA AGT 



E V A V L 
GAA GTT GCT GTT CTA 

600 

T A I H A 
ACA GCA ATT CAT GCA 



410 

G C V L A 
GGT TGT GTC CTT GCT 



T G N Y N 
ACT GGT AAT TAT AAT 

440 

L R P F E 
CTT AGG CCC TTT GAG 



D G K P C 
GAT GGC AAA CCT TGC 

470 

L N D Y G 
TTA AAT GAT TAT GGT 



P Y R V V 
CCT TAC AGA GTT GTA 

500 

A T V C G 
GCC ACG GTT TGT GGA 



Q C V N F 
CAG TGT GTC AAT TTT 

530 

L T P S S 
TTA ACT CCT TCT TCA 



R D V S D 
CGT GAT GTT TCT GAT 

560 

S E I L D 
TCT GAA ATA TTA GAC 



V I T P G 
GTA ATT ACA CCT GGA 
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Y Q D V N 
TAT CAA GAT GTT AAC 



D Q L T P 
GAT CAA CTC ACA CCA 



W N T R N 
TGG AAT ACT AGG AAC 

430 

Y K Y R Y 
TAT AAA TAT AGG TAT 



R D I S N 
AGA GAC ATA TCT AAT 

460 

T P P A L 
ACC CCA CCT GCT CTT 



F Y T T T 
TTT TAC ACC ACT ACT 

490 

V L S F E 
GTA CTT TCT TTT GAA 



P K L S T 
CCA AAA TTA TCC ACT 

520 

N F N G L 
AAT TTT AAT GGA CTC 



K R F Q P 
AAG AGA TTT CAA CCA 

550 

F T D S V 
TTC ACT GAT TCC GTT 



I S P C S 
ATT TCA CCT TGC TCT 

580 

T N A S S 
ACA AAT GCT TCA TCT 



C T D V S 
TGC ACT GAT GTT TCT 

610 

A W R I Y 
GCT TGG CGC ATA TAT 



WO 2004/092360 



115/193 



PCT/US2004/011710 



620 

STGNNVFQTQAGCLI 
TCT ACT GGA AAC AAT GTA TTC CAG ACT CAA GCA GGC TGT CTT ATA 

630 640 
GAE HVD T • S Y E C D I P I 
GGA GCT GAG CAT GTT GAT ACT TCT TAT GAG TGC GAC ATT CCT ATT 

650 

G.AGICASYHTOC SEQ ID NO: 9799 

GGA GCT GGC ATT TGT GCT AGT TAC CAT AC A TAA TGAGTCGAC SEQ ID NO: 9800 

Translated Mol . Weight = 72525 . 52 



AAGCTTACAAAACAAA 



FIGURE 66 

1 10 
MSDLDRCTTF 
ATG AGT GAC CTT GAC CGG TGC ACC ACT TTT 



20 

DDVQAPNYTQHTSSM 
GAT GAT GTT CAA GCT CCT AAT TAC ACT CAA CAT ACT TCA TCT ATG 

30 40 
RGVYYPDE I FRSDTL 
AGG GGG GTT TAC TAT CCT GAT GAA ATT TTT AGA TCA GAC ACT CTT 

50 

Y Xj t q d lflpfysn'vt 

TAT TTA ACT CAG GAT TTA TTT CTT CCA TTT TAT TCT AAT GTT ACA 

60 70 
GFHTINHTFGNPVIP 
GGG TTT CAT ACT ATT AAT CAT ACG TTT GGC AAC CCT GTC ATA CCT 

80 

FKDGIYFAATEKSNV 
TTT AAG GAT GGT ATT TAT TTT GCT GCC ACA GAG AAA TCA AAT GTT 

90 100 
VRGWVFG S TMNNKSQ 
GTC CGT GGT TGG GTT TTT GGT TCT ACC ATG AAC AAC AAG TCA CAG 

110 

SVIIINNSTNVVIRA 
TCG GTG ATT ATT ATT AAC AAT TCT ACT AAT GTT GTT ATA CGA GCA 

120 130 
CNFELCDNPFFAVSK 
TGT AAC TTT GAA TTG TGT GAC AAC CCT TTC TTT GCT GTT TCT AAA 

140 

PMGTQTHTMI FDNAF 
CCC ATG GGT ACA CAG ACA CAT ACT ATG ATA TTC GAT AAT GCA TTT 

150 160 
NCTFEYI SDAFSLDV 
AAT TGC ACT TTC GAG TAC ATA TCT GAT GCC TTT TCG CTT GAT GTT 
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S E K S G 
TCA GAA AAG TCA GGT 

180 

K N K D G 
AAA AAT AAA GAT GGG 



I D V V R 
ATA GAT GTA GTT CGT 

210 

P I F K L 
CCT ATT TTT AAG TTG 



A I L T A 
GCC ATT CTT ACA GCC 

240 

S A A A Y 
TCA GCT GCA GCC TAT 



M L K Y D 
ATG CTC AAG TAT GAT 

270 

C S Q N P 
TGT TCT CAA AAT CCA 



f e: I D K 

TTT GAG ATT GAC AAA 

300 

V P S G D 
GTT CCC TCA GGA GAT 



C P F G E 
TGT CCT TTT GGA GAG 

330 

Y A W E R 
TAT GCA TGG GAG AGA 



S V L Y N 
TCT GTG CTC TAC AAC 

360 

G V S A T 
GGC GTT TCT GCC ACT 



Y A D S F 
TAT GCA GAT TCT TTT 

390 

A P G Q T 
GCG CCA GGG CAA ACT 



170 

N F K H L 
AAT TTT AAA CAC TTA 



F L Y V Y 
TTT CTC TAT GTT TAT 

200 

D L P S G 
GAT CTA CCT TCT GGT 



P L G I N 
CCT CTT GGT ATT AAC 

230 

F S P A Q 
TTT' TCA CCT GCT CAA 



F V G Y L 
TTT GTT GGC TAT TTA 

260 

E N G T I 
GAA AAT GGT ACA ATC 



L A E L K 
CTT GCT GAA CTC AAA 

290 

G I Y Q T 
GGA ATT TAC CAG ACC 



V V R F P 
GTT GTG AGA TTC CCT 

320 

V F N A T 
GTT TTT AAT GCT ACT 



K K I S N 
AAA AAA ATT TCT AAT 

350 

S T F F S 
TCA ACA TTT TTT TCA 



K L N D L 
AAG TTG AAT GAT CTT 

380 

V V K G D 
GTA GTC AAG GGA GAT 



G V I A D 
GGT GTT ATT GCT GAT 



R E F V F 
CGA GAG TTT GTG TTT 

190 

K G Y Q P 
AAG GGC TAT CAA CCT 



F N T L K 
TTT AAC ACT TTG AAA 

220 

I T N F R 
ATT ACA AAT TTT AGA 



D I W G T 
GAC ATT TGG GGC ACG 

250 

K P T T F 
AAG CCA ACT ACA TTT 



T D A V D 
ACA GAT GCT GTT GAT 

280 

C S V K S 
TGC TCT GTT AAG AGC 



S N F R V 
TCT AAT TTC AGG GTT 

310 

N I T N L 
AAT ATT ACA AAC TTG 



K F P S V 
AAA TTC CCT TCT GTC 

340 

C V A D Y 
TGT GTT GCT GAT TAC 



T F K C Y 
ACC TTT AAG TGC TAT 

370 

C F S N V 
TGC TTC TCC AAT GTC 



D V R Q I 
GAT GTA AGA CAA ATA 

400 

Y N Y K L 
TAT AAT TAT AAA TTG 
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P D D F M 
CCA GAT GAT TTC ATG 

420 

I D A T S 
ATT GAT GCT ACT TCA 



L R H G K 
CTT AGA CAT GGC AAG 

450 

V P F S P 
GTG CCT TTC TCC CCT 



N C Y W P 
AAT TGT TAT TGG CCA 

480 

G I G Y Q 
GGC ATT GGC TAC CAA 



L L N A P 
CTT TTA AAT GCA CCG 

510 

D L I K N 
GAC CTT ATT AAG AAC 



T G T G V 
ACT GGT ACT GGT GTG 

540 

F Q Q F G 
TTT CAA CAA TTT GGC 



R D P K T 
CGA GAT CCT AAA ACA 

570 

F G G V S 
TTT GGG GGT GTA AGT 



E V A V L 
GAA GTT GCT GTT CTA 

600 

T A I H A 
ACA GCA ATT CAT GCA 



S T G N N 
TCT ACT GGA AAC AAT 

630 

G A E H V 



410 

G C V L A 
GGT TGT GTC CTT GCT 



T G N Y N 
ACT GGT AAT TAT AAT 

440 

L R P F E 
CTT AGG CCC TTT GAG 



D G K P C 
GAT GGC AAA CCT TGC 

470 

L N D Y G 
TTA AAT GAT TAT GGT 



P Y R V V 
CCT TAC AGA GTT GTA 

500 

A T V C G 
GCC ACG GTT TGT GGA 



Q C V N F 
CAG TGT GTC AAT TTT 

530 

L T P S S 
TTA ACT CCT TCT TCA 

R D V S D 
CGT GAT GTT TCT GAT 

560 

S E I L D 
TCT GAA ATA TTA GAC 



V I T P G 
GTA ATT ACA CCT GGA 

590 

Y Q D V N 
TAT CAA GAT GTT AAC 



D Q L T P 
GAT CAA CTC ACA CCA 

620 

V F Q T Q 
GTA TTC CAG ACT CAA 



D T S Y E 



W N T R N 
TGG AAT ACT AGG AAC 

430 

Y K Y R Y 
TAT AAA TAT AGG TAT 



R D I S N 
AGA GAC ATA TCT AAT 

460 

T P P A L 
ACC CCA CCT GCT CTT 



F Y T T T 
TTT TAC ACC ACT ACT 

490 

V L S F E 
GTA CTT TCT TTT GAA 



P K L S T 
CCA AAA TTA TCC ACT 

520 

N F N G L 
AAT TTT AAT GGA CTC 



K R F Q P 
AAG AGA TTT CAA CCA 

550 

F T D S V 
TTC ACT GAT TCC GTT 



I S P C S 

ATT TCA CCT TGC TCT 

-■< 

580 

T N A S S 
ACA AAT GCT TCA TCT 



C T D V S 
TGC ACT GAT GTT TCT 

610 

A W R I Y 
GCT TGG CGC ATA TAT 



A G C L I 
GCA GGC TGT CTT ATA 

640 

C D I P I 
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GGA GCT GAG CAT GTC 



G A G I C 
GGA GCT GGC ATT TGT 

660 

S T S Q K 
AGT ACT AGC CAA AAA 



A D S S I 
GCT GAT AGT TCA ATT 

690 

T N F S I 
ACT AAC TTT TCA ATT 



MART S 
ATG GCT AAA ACC TCC 

720 

S T E C A 
TCT ACT GAA TGT GCT 



T Q L N R 
ACA CAA CTA AAT CGT 

750 

R N T R E 
CGC AAC ACA CGT GAA 



T P T L K 
ACC CCA ACT TTG AAA 

780 

L P D P L 
TTA CCT GAC CCT CTA 



L L F N K 
TTG CTC TTT AAT AAG 

810 

Q Y G E C 
CAA TAT GGC GAA TGC 



C A Q K F 
TGT GCG CAG AAG TTC 

840 

T D D M I 
ACT GAT GAT ATG ATT 



T A T A G 
ACT GCC ACT GCT GGA 



GAC ACT TCT TAT GAG 

650 

A S Y H T 
GCT AGT TAC CAT ACA 



S I V A Y 
TCT ATT GTG GCT TAT 

680 

A Y S N N 
GCT TAC TCT AAT AAC 



S I T T E 
AGC ATT ACT ACA GAA 

710 

V D C N M 
GTA GAT TGT AAT ATG 



N L L L Q 
AAT TTG CTT CTC CAA 

740 

A L S G I 
GCA CTC TCA GGT ATT 



V F A Q V 
GTG TTC GCT CAA GTC 

770 

Y F G G F 
TAT TTT GGT GGT TTT 



K P T K R 
AAG CCA ACT AAG AGG 

800 

V T L A D 
GTG ACA CTC GCT GAT 



L G D I N 
CTA GGT GAT ATT AAT 

830 

N G L T V 
AAT GGA CTT ACA GTG 



A A Y T A 
GCT GCC TAC ACT GCT 

860 

W T F G A 
TGG ACA TTT GGT GCT 



TGC GAC ATT CCT ATT 



V S L L R 
GTT TCT TTA TTA CGT 

670 

T M S L G 
ACT ATG TCT TTA GGT 



T I A I P 
ACC ATT GCT ATA CCT 

700 

V M P V S 
GTA ATG CCT GTT TCT 



Y I C G D 
TAC ATC TGC GGA GAT 

730 

Y G S F C 
TAT GGT AGC TTT TGC 



A A E Q D 
GCT GCT GAA CAG GAT 

760 

K Q M Y K 
AAA CAA ATG TAC AAA 



N F S Q I 
AAT TTT TCA CAA ATA 

790 

S F I E D 
TCT TTT ATT GAG GAC 



A G F M K 
GCT GGC TTC ATG AAG 

820 

A R D L I 
GCT AGG GAC CTC ATT 



L P P L L 
TTG CCA CCT CTG CTC 

850 

A L V S G 
GCT CTA GTT AGT GGT 



G A A L Q 
GGC GCT GCT CTT CAA 



870 



880 
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IPFAMQMAYRFNGIG 
ATA CCT TTT GCT ATG CAA ATG GCA TAT AGG TTC AAT GGC ATT GGA 

890 

VTQNVLYENQKQIAN 
GTT ACC CAA AAT GTT CTC TAT GAG AAC CAA AAA CAA ATC GCC AAC 

900 910 
QFNKAISQIQESLTT 
CAA TTT AAC AAG GCG ATT AGT CAA ATT CAA GAA TCA CTT ACA ACA 

920 

TSTALGKLQDVVNQN 
ACA TCA ACT GCA TTG GGC AAG CTG CAA GAC GTT GTT AAC CAG AAT 

930 940 
AQALNTLVKQLSSNF 
GCT CAA GCA TTA AAC ACA CTT GTT AAA CAA CTT AGC TCT AAT TTT 

950 

GAIS SVLNDILSRLD 
GGT GCA ATT TCA AGT GTG CTA AAT GAT ATC CTT TCG CGA CTT GAT 

960 970 
KVEAEVQ IDRL ITGR 
AAA GTC GAG GCG GAG GTA CAA ATT GAC AGG TTA ATT ACA GGC AGA 

980 

LQSLQTYVTQQLIRA 
CTT CAA AGC CTT CAA ACC TAT GTA ACA CAA CAA CTA ATC AGG GCT 

990 1000 
AEIRASAN'LAATKMS 
GCT GAA ATC AGG GCT TCT GCT AAT CTT GCT GCT ACT AAA ATG TCT 

1010 

ECVLGQS KRVDFCGK 
GAG TGT GTT CTT GGA CAA TCA AAA AGA GTT GAC TTT TGT GGA AAG 

1020 1030 
GYHLMSF PQAAPHGV 
GGC TAC CAC CTT ATG TCC TTC CCA CAA GCA GCC CCG CAT GGT GTT 

1040 

VFLHVTYVPSQERNF 
GTC TTC CTA CAT GTC ACG TAT GTG CCA TCC CAG GAG AGG AAC TTC 

1050 1060 
TTAPAI CHEGKAYFP 
ACC ACA GCG CCA GCA ATT TGT CAT GAA GGC AAA GCA TAC TTC CCT 

1070 

REGVFVFNGTSWF IT 
CGT GAA GGT GTT TTT GTG TTT AAT GGC ACT TCT TGG TTT ATT ACA 

1080 1090 
QRNFFS PQI ITTDNT 
CAG AGG AAC TTC TTT TCT CCA CAA ATA ATT ACT ACA GAC AAT ACA 

1100 

FVSGNCDVVIGIINN 
TTT GTC TCA GGA AAT TGT GAT GTC GTT ATT GGC ATC ATT AAC AAC 
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1110 1120 
TVYDPLQPELDSFKE 
ACA GTT TAT GAT CCT CTG CAA CCT GAG CTT GAC TCA TTC AAA GAA 

1130 

ELDKYFKNHTSPDVD 
GAG CTG GAC AAG TAC TTC AAA AAT CAT ACA TCA CCA GAT GTT GAT 

1140 1150 
FGDISGINASVVNIQ 
TTT GGC GAC ATT TCA GGC ATT AAC GCT TCT GTC GTC AAC ATT CAA 

1160 

KEIDRLNEVAKNLNE 
AAA GAA ATT GAC CGC CTC AAT GAG GTC GCT AAA AAT TTA AAT GAA 

1170 1180 
SLIDLQELGKYEQYI 
TCA CTC ATT GAC CTT CAA GAA TTG GGA AAA TAT GAG CAA TAT ATT 

1183 

K W P OC SEQ ID NO: 9801 

AAA TGG CCT TAA TGAGTCGAC SEQ ID NOs 9802 

Translated Mol . Weight = 131315.20' 
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FIGURE 111 
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FIGURE 113 

5' 3' Frame 1 

PKDMTYVDS SL - WVS - ITKSMVTLICL S PAKKLFVTFVRGLALM- RAVMQLEMLWVLTYL 
SS-DFLQVLT — LYRLVMLTLKITQNSPELMHKPPPVSSLNILYHSCIKACPGM-CVLR- 
YKCSVIH-KDCQTESCSSFGRMALSLHQ-STLSRLDLKERWCVTNVQLAFLLHQILMPA 
GI ILWVLTMSITHL-LMFSSGGFTGNLSE- P- PTLPGTWKCTCGLWML 



5 '3' Frame 2 

QRT-PT-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN-RCCGY-PT 
SPARIFYRC-LSSCTDWLC-H-K-HKIHQS-CTNLHQ-AV-TSYTTHV-RLALECSAY-D 
STNAQ-YTERIWQSRVRPLGAWL-AYINEVLCQDWT-KNVLSV-QTCNLLFYFIRYLCL 
LESFCGF-LCL-PIYD-CSAVGALRVTFQSNHDQHCQVHGNAHVG-L-C 

5 '3' Frame 3 

KGHDLRRLISmGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLP 
LQLGFSTGWLVAVPTGWDTENNTKFTRWAQTSTSEQFKHLIPLMYKGLPW1WVRIKI 
VQMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC 
WNHSVGFDYVYNPFMIDVQQWGLYG- PFRVTMTNIARYMEMHMWASCDA 

3" 5' Frame 1 

-HHN-PTCAFPCTWQCWSWLL-KVTRKAPTAEHQS-MGYRHSQNPQNDSSRHKYLMK-KS 
KLHVCHTDNTFFQVQS -QSTSLM-AQSHAPKGRTRLCLTILSVYH-AFVLS -YALHSRAS 
LYT-W-DV-TAHWWRFVH-LW-ILCYFQCQHNQSVQLLS-HL-KILAGEVG-YPQHL-L 
HDSPLHQSQSTHERDE-LLRG — TY-GNH-LGNSF-NPS-R-VYVGHVLW 

3' 5* Frame 2 

S ITTS PHVHFHVPGNVGHGYSERLPVKPPLLNINHKWVIDIVKTHRMI PAGI S I — SRKA 
SCTFVTQTTRSFRSNLDKVLH-CKLKAMRPKDEHDSV-QSFQCITEHLYYLNTHYIPGQA 
FIHEWYKMFKLLTGGGLCINSGEFCVIFSVNITSRYSY-VNTCRKS-LER-VSTHSISSC 
MTALYIKANPRTWTNSFFAGDKHIRVTIDLVIHFETHHRDEST-VMSF 

3 '5' Frame 3 

ASQLAHMCISMYLAMLVMVTLKGYP-SPHC-TSIINGL-T-SKPTE-FQQA-VSDEVEKQ 
VARLSHRQHVLSGPILTKYFIDVSSKPCAQRTNTTLSDNPFSVSLSICTILIRTTFQGKP 
LYMSGIRCLNCSLVEVCALTLVNFVLFSVST-PVGTATKLTPVENPSWRGRLVPTASLVA 
-QPSTSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLRRSCPL 
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FIGURE 114 

5' 3' Frame 1 

YRRLISmGFKlOTQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLG 
FSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLI 

5" 3' Frame 2 

TVDS SL - WVSK- ITKSMVTLICLS PAKKLFVTFVRGLALM-RAVMQLEMLWVLTYLSS -D 
FLQVLT — LYRLVMLTLKITQNSPELMQNLHQVTSLNILY 

5' 3' Frame 3 

P-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN-RCCGY-PTSPARI 
FYRC-LSSCTDWLC-H-K-HRIHQS-CKTSTR-PV-TSYT 

3 '5' Frame 1 

GIRCLNWSPGGGFALTLVNSVLFSVST-PVGTATKLTPVENPSWRGRLVPTASLVA-QPS , 
TSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLR 

3 '5' Frame 2 

V-DV-TGHLVEVLH-LW-ILCYFQCQHNQSVQLLS-HL-KILAGEVG-YPQHL-LHDSPL 
HQSQSTHERDE-LLRG— TY-GNH-LGNSF-NPS-R-VYG 

3*5' Frame 3 

YKMFKLVTWWRFCINSGEFCVIFSVNITSRYSY-VNTCRKS-LER-VSTHSISSCMTALY 
IKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDESTV 
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FIGURE 119 
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FIGURE 121 
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SEQ ID NO: 10033 TACCGTAGACTCATCTCTATGATGGGTTTCAAAA 

SEQ ID NO: 10084 CCTAGGCATACCCAAAGGACATGACCTACCGTAGACTCATCTCTATGATGGGTTTCAAAA 

Consensus TACCGTAGACTCATCTCTATGATGGGTTTCAAAA 

Prim, cons . CCTAGGCATACCCAAAGGACATGACCTACCGTAGACTCATCTCTATGATGGGTTTCAAZ^A 
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SEQ ID NO: 10033 TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 

SEQ ID NO: 10084 TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 

Consensus TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 

Prim, cons . TGAATTACC AAGTC AATGGTTACC CTAATATGTTTATC AC C C GC GAAGAAGCTATTC GTC 
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SEQ ID NO: 10033 ACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 

SEQ ID NO: 10084 ACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 

Consensus ACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 

Prim. cons. . ACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 
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SEQ ID NO: 10033 GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 

SEQ ID NO: 10084 GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 
Consensus GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 
Prim, cons . GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 
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SEQ ID NO: 10033 CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 

SEQ ID NO: 10084 CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 

Consensus CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 

Prim, cons . CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 



310 320 330 340 350 360 

I I I I I I 
SEQ ID NO: 10033 CAGGTGACCAGTTTAAACATCTTATACC 

SEQ ID NO: 100-84 CAGGTGACCAGTTTAAACATCTTATACCACTCATGTATAAAGGCTTGCCCTGGAATGTAG 

Cons ensus CAGGTGACCAGTTTAAACATCTTATACC 

Prim, cons . CAGGTGACCAGTTTAAACATCTTATACCACTCATGTATAAAGGCTTGCCCTGGAATGTAG 



etc. 
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FIGURE 122 

5' 3" Frame 1 

cctaggcatacccaaaggacatgacctaccgtagactcatctctatgatgggtttcaaaa 

PRHTQRT-PTVDSSL.-WVSK 
tgaattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtc 

-ITKSMVTLICLSPAKKLFV 
acgttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtgg 

TFVRGLALM-RAVMQLEMLW 
gtactaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccga 

VLTYLSS-DFLQVLT - - LYR 
ctggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccac 

LVMLTLKITQNSPELMQNLH 
caggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtag 

QVTSLNIL YHSCIKACPGM- 
tgcgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcg 

CVLR-YKCSVIH-KDCQTES 
tgttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattg 

C S S FGRMAL S LHQ - S TL S RL 
gacctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcag 

DLKERVVCVTNVQLAFLLHQ 
atacttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatga 

ILMPAGIILWVLTMSITHL - 
ttgatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgcc 

LMF S S GALRVT FRVTMTN I A 
aggtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcag 

R YMEMHMWLVVML S - LD V - Q 
tccatgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatg 

SMSALLSALIGL LNTLL-EM 
aactgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgc 

N - G L I L LAE KYN TWL SLHC 
ttgctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgc 

LLI SFQFFMT-EIQRLSSVC 
ctcaggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttaca 

LRLK-NGS STMLSHVVTKLT 
aaatagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgttt 

K-RNSSILMLYITINSLMVF 
gtttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttg 

VCFGIVTLIVTQPMQLCVGL 
acacaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtga 

TQESCQT-TYQAVMVVVCM- 
ataagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaat 

ISMHSTLQLSIKVHLLI-SN 
tgcctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcgg 

CLSFTILIVLVSLMANK-CR 
atattgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtg 

ILIMFHSNLLRVLHDAI-VV 
ctgtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatga 

LFADTMQMSTDSTWMHII - - 
tttctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaata 

FLLDLAYGFTNNLILITCGI 
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catttaccaggttacagagttta 
H L P G Y R V 



5 1 3 1 Frame 2 

cctaggcatacccaaaggacatgacctaccgtagactcatctctatgatgggtttcaaaat 

LGIPKGHDLP-THLYDGFQN 
gaattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtca 

ELPSQWLP-YVYHPRRSYSS 
cgttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtggg 

RSCVDWL-CRGLS CN-RCCG 
tactaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccgac 

Y-PTSPARIFYRC-LSSCTD 
tggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccacc 

WLC-H-K-HRIHQS-CKTST 
aggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtagt 

R-PV-TSYTTHV-RLALECS 
gcgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcgt 

AY-DSTNAQ-YTERIVRQSR 
gttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattgg 

VRPLGAWL -AYI NEVLCQDW 
acctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcaga 

T-KNVLSV-QTCNLLFYFIR 
tacttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatgat 

YLCLLESFCGF-LCL-PIYD 
tgatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgcca 

-CS AVGLYG-PSE-P-PTLP 
ggtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcagt 

GTW l KCTCG-L C YHD-MFSS 

ccatgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatga 

P-VLC-AR-LVC-IPYYRR- 
actgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgct 

TEG-FCLQKSTTHGCEVCIA 
tgctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgcc 

C--VSSSS-HRKSKGYQVCA 
tcaggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttacaa 

SG-SRMEVLRCSAM - - QSLQ 
aatagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgtttg 

NRGTLLFLCYTSR-IH-WCL 
tttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttga 

FVLEL - R - SL P S Q C N C V - V - 
cacaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtgaa 

HKSLVKLELTRL - WW-FVCE 
taagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaatt 

-ACIPHSSFR-KCIY-FKAI 
gcctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcgga 

AFLLLF--SL-VSWQTSSVG 
tattgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtgc 

Y-LCSTQICYVYYTMQFRWC 
tgtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatgat 
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CLQTPCK-VPTVLGCI-YDD 
ttctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaatac 

FCWI-PMDLQTI-YL-PVEY 
atttaccaggttacagagttta 

I Y Q V T E F 



5 ■ 3 ■ Frame 3 

cctaggcatacccaaaggacATGacctaccgtagactcatctctatgatgggtttcaaaatg 

-AYPKDMTYRRL I SMMGFKM 
aattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtcac 

NYQVNGYPNMFITREEAIRH 
gttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtgggt 

VRAWIGFDVEGCHATRDAVG 
actaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccgact 

T NLPLQLGFSTGVNLVAVPT 
ggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccacca 

GYVDTENNTEFTRVNAKPPP 
ggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtagtg 

GDQFKHLIPLMYKGLPWNVV 
cgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcgtg 

RIKIVQML S'DTLKGLSDRVV 
ttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattgga 

fvlwahg'feltsmkyfvkig 

cctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttc tact teat cagat 

PERTCC LCDK RA TCFSTSSD 
acttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatgatt 

TYACWNHSVG FDYVYNPFMI 
gatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgccag 

DVQQWGFTGN LQSNHDQHCQ 
gtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcagtc 

VHGNAHVASCDAIMTRCLAV 
catgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatgaa 

HECFVKRVDWSV EYPIIGDE 
ctgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgctt 

L RVN S A C RKVQ HMVVK SAL L 
gctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgcct 

ADKFPVLHDIGNPKAIKCVP 
caggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttacaaa 

QAEVEWKFYDAQPC SDKAYK 
atagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgtttgt 

IEELFYSYAIHHDKFTDGVC 
ttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttgac 

LFWNCNVDRYPANAI VCRFD 
acaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtgaat 

TRVLSNLNLPGCDGGSLYVN 
aagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaattg 

KHAFHTPAFDKSAFTNLKQL 
cctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcggat 

PFFYYSDSPCESHGKQVVSD 
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attgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtgct 

IDYVPLKSATCITRCNLGGA 
gtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatgatt 

VCRHHANEYRQYLDAYNMMI 
tctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaataca 

SAGFSLWIYKQFDTYNLWNT 
tttaccaggttacagagttta 

F T R L Q S L 



3 1 5 1 Frame 1 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaat 

-TL-PGKCIPQVISIKLFVN 
ccataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactc 

P-AKSSRNHH I ICIQVLSVL 
atttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcaga 

ICM VSANSTT- IASCNTRSR 
tttgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggact 

FEWNI INIRHYLFAMRLTRT 
atcagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagc 

IRIVKERQLL-ISKCTFIES 
tggagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagtt 

WSVECMLIHIQTTTITAW-V 
caagtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatc 

QV - * Q D S CVKPTHNC I GWVT I 
aacgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcata 

NVT I PK QTNTI SEF IVMYS I 
agaatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaa 

RIEEFLYFVSFVTTWLSIVE 
cttccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatg 

LPFYFSLRHTLDSLWISYVM 
aagaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttct 

KNWKL I SKQCRLHNHVLYFS 
gcaagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaac 

ASR.INPQFISYNRVFNRPIN 
gcgcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagc 

ALNKAL. MDC-TSSHDSITTS 
cacatgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgt 

HMC I SMYLAMLVMVTLKVTR 
aaagccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacaga 

KAPLLNINHKWVIDIVKTHR 
atgattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcaca 

MIPAGISI--SRKASCTFVT 
cagacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaa 

QTTRSFRSNLDKVLH-CKLK 
gccatgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgag 

AMRPKDEHDSV-QSFQCITE 
catttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtat 

HLYYLNTHYIPGQAFIHEWY 
aagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgtt 
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KMFKLVTWWRFC INSGEFCV 
attttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatcc 

IFSVNITSRYSY-VNTCRKS 
tagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatc 

-LER-VSTHSI SSCMTALYI 
aaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattagg 

KANPRTNVTNSFFAGDKHIR 
gtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggt 

VTIDLVIHFETHHRDESTVG 
catgtcctttgggtatgcctagg 

H V L W V C L 



3 ■ 5 ■ Frame 2 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaatc 

KLCNLVNVFHRL-VSNCL-I 
cataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactca 

HRLNPAEI IILYASKYCRYS 
tttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcagat 

FAWCLQTAPPKLHRVIHVAD 
ttgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggacta 

L S G T - S I S D T T C L P - D S Q G L 
tcagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagct 

S E - - KKGNCFKLVNALLSKA 
ggagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagttc 

GVWNACLFTYKLPPSQPGKF 
aagtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatca 

KFDKTLVSNLHTIALAG-RS 
acgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcataa 

TLQFQNKQTPSVNLS-CIA- 
gaatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaac 

E-KSSSIL-ALSLHG-AS-N 
ttccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatga 

FHSTSA-GTHLIAFGFPMS- 
agaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttctg 

RTGNLSASNADFTTMCCTFL 
caagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaacg ' 

QAELTLSSSPIIGYST DQST 
cgcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagcc 

RLTKHSWTAKHLVMIASQLA 
acatgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgta 

TCAFPCTWQCWSWLL-RLPV 
aagccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacagaa 

KPHC-TSIINGL-T-SKPTE 
tgattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcacac 

-FQQA-VSDEVEKQVARLSH 
agacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaag 

RQHVLSGPILTKYFIDVSSK 
ccatgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgagc 

PCAQRTNTTLSDNPFSVSLS 
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atttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtata 

ICTILIRTTFQGKPLYMSGI 
agatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgtta 

RCLNWS PGGGFALTLVNSVL 
ttttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatcct 

F S V S T - PVGTATKLTPVENP 
agctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatca 

SWRGR LVPTASLVA-QPSTS 
aagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattaggg 

KPIHART-RIASSRVINILG 
taaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggtc 

-PLTW-FILKPIIEMSLR-V 
atgtcctttgggtatgcctagg 

M S F G Y A - 



3 '5' Frame 3 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaatcc 

NSV TW-MYSTGYKYQIVCKS 
ataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactcat 

I G - I QQKSSYYM HPSTVGTH 
ttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcagatt 

LHGV CKQHHLNC I V - Y T - Q I 
tgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggactat 

-VEHNQYPTLLVCHETHKDY 
cagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagctg 

Q N S K R K A I A L N -MHFYRKL 
gagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagttca 

ECGMHAYSHTNYHHHSLVSS 
agtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatGaa 

SLTRLLCQTYTQLHWLGNDQ 
cgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcataag 

RYNSKTNKHHQ - IYRDV-HK 
aatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaact 

NRRVPLFCKLCHYMAEHRRT 
tccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatgaa 

SILLQPEAHT- -PLDFLCHE 
gaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttctgc 

ELETYQQAMQTSQPCVVLFC 
aagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaacgc 

KQN-PS VHLL - - GIQQTNQR 
gcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagcca 

A-QSTHGLLNI-S--HHN-P 
catgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgtaa 

HVHFHVPGNVGHGYSEGYP- 
agccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacagaat 

SPTAEHQS -MGYRHSQNPQN 
gattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcacaca 

DSSRHKYLMK-KSKLHVCHT 
gacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaagc 
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DNTFFQVQS-Q STSLM-AQS 
catgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgagca 

HAPKGRTRLCLTILSVYH-A 
tttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtataa 

FVLS-YALHSRASLYT-VV- 
gatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgttat 

DV-TGHLVEVLH-LW - ILCY 
tttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatccta 

FQCQHNQSVQLLS-HL-KIL 
gctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatcaa 

AGEVG-YPQHL - LHDSPLHQ 
agccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattagggt 

SQSTHERDE-LLRG--TY-G 
aaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggtca 

NH-LGNSF-NPS-R-VYGRS 
tgtcctttgggtatgcctagg 

C P L G M P R 
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FIGURE 123 



CCTAGGCATACCCAAAGGACATGACCTACCGTAGACTCATCTCTATGATGGGTTTCAAAATGMTTACCMGTCMTGGT 

i N..N i N. . . . 

TACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTCACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCA 

i N N 

TGCAACTAGAGATGCTGTGGGTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 

N 

CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCACCAGGTGACCAGTTTAAACAT 

N N 

CTTATACCACTCATGTATAAAGGCTTGCCCTGGMTGTAGTGCGTATTAAGATAGTACAAATGCTCAGTGATACACTGAA 

N N N 

AGGATTGTCAGACAGAGTCGTGTTCGTCCTTTGGGCGCATGGCTTTGAGCTTACATCAATGAAGTACTTTGTCAAGATTG 

N N 

GACCTGAAAGAACGTGTTGTCTGTGTGACAAACGTGCAACTTGCTTTTCTACTTCATCAGATACTTATGCCTGCTGGAAT 

N 

CATTCTGTGGGTTTTGACTATGTCTATAACCCATTTATGATTGATGTTCAGCAGTGGGGCTTTACGGGTAACCTTCAGAG 

N N N 

TAACCATGACCAACATTGCCAGGTACATGGAAATGCACATGTGGCTAGTTGTGATGCTATCATGACTAGATGTTTAGCAG 

N N N N N i N 

TCCATGAGTGCTTTGTTAAGCGCGTTGATTGGTCTGTTGAATACCCTATTATAGGAGATGAACTGAGGGTTAATTCTGCT 

...N N 

TGCAGAAAAGTACAACACATGGTTGTGAAGTCTGCATTGCTTGCTGATAAGTTTCCAGTTCTTCATGACATAGGAAATCC 

i N. 

AAAGGCTATCAAGTGTGTGCCTCAGGCTGAAGTAGAATGGAAGTTCTACGATGCTCAGCCATGTAGTGACAAAGCTTACA 

N N N 

AAATAGAGGAACTCTTCTATTCTTATGCTATACATCACGATAAATTCACTGATGGTGTTTGTTTGTTTTGGAATTGTAAC 

N N 

GTTGATCGTTACCCAGCCAATGCAATTGTGTGTAGGTTTGACACAAGAGTCTTGTCAAACTTGAACTTACCAGGCTGTGA 

N ....N 

TGGTGGTAGTTTGTATGTGAATAAGCATGCATTCCACACTCCAGCTTTCGATAAAAGTGCATTTACTAATTTAAAGCAAT 

N N 

TGCCTTTCTTTTACTATTCTGATAGTCCTTGTGAGTCTCATGGCAAACAAGTAGTGTCGGATATTGATTATGTTCCACTC 

i N 

AAATCTGCTACGTGTATTACACGATGCAATTTAGGTGGTGCTGTTTGCAGACACCATGCAAATGAGTACCGACAGTACTT 

N N N 

GGATGCATATAATATGATGATTTCTGCTGGATTTAGCCTATGGATTTACAAACAATTTGATACTTATAACCTGTGGAATA 

. .N N. .N N 

CATTTACCAGGTTACAGAGTTTA SEQ ID NO: 10084 
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FIGURE 123 (contd.) 
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FIGURE 124 



Sequences : 



74827 |pir | |VFIHJH genome polyprotein lb - murine hepatit. 
14917044|sp|P29982|RRPB_CVMJH RNA-directed RNA polymeras . 
26007546 jref |NP_068668. 2 | ORFlab polyprotein [Murine hep. 



7769342 
6625761 
2641128 
4377413 



gb 
gb 

gb 



AAF69332.1 
AAF19384 .1 
AAB86818.1 



emb CAA36202 



AF208066_2 RNA-directed RNA polyme. 
AF201929_2 RNA-directed RNA polyme. 

RNA-directed RNA polymerase [muri. 
. I open reading frame lb <AA 1-2733. 



133592 sp P16342 RRPB_CVMA5 RNA-DIRECTED RNA POLYMERASE 



26008080 
15077820 
18033972 



ref |NP_150073 .2 | orflab polyprotein [Bovine cor.. 



gb 
gb 



AAK83365.1 
AAL57305.1 



replicase [bovine coronavirus] 
replicase [bovine coronavirus] 



ref 
ref 
ref 
emb 



NP_740 618.1 
NP_742140.1 
NP_066134.1 
CAC39112.ll 



7769353 |gb|AAF69342 .l|AF208067_2 RNA-directed RNA polyme. 

gb|AAL40397.l|AF220295_2 RNA polymerase lb [bov. 

coronavirus nspll [Murine hepa. 
coronavirus nspll [Bovine coro . 
ORFlab polyprotein; frameshift. 
replicase polyprotein lab [Avia. 
emb| CAA83018 . 1 | potential chimeric protein [Avian. 
sp]P26314 |RRPB_IBVB RNA-DIRECTED RNA POLYMERASE (. 
gb|AAO67706.l| ORFlb polyprotein [Avian infecti . 
ref |NP_740631 . 1 | coronavirus nspll [Avian infec. 

replicase [Transmissible gastro. 
Poll [porcine epidemic diarrhe. 
replicase polyprotein lab [Hum. 
13359l|sp|Pl8458|RRPB_BEV RNA-directed RNA polymerase (0. 
1513061 | dbj |BAA13323.1| cyanoprotein alpha subunit precu. 



17529672 
25121571 
26008092 
10242469 
14149033 
458735 
133594 
29293454 
25121555 
9635157 | ref |NP_058422 . 1 | 



19387582 
12175747 



ref 
ref 



NP_598309.1 
NP_073549.1 



(bits) 


Value 


638 


0.0 


637 


0.0 


637 


0.0 


637 


0.0 


637 


0.0 
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0.0 
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0.0 
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e-180 
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e-180 
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e-163 


575 


e-163 
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e-160 
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e-153 


541 
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535 


e-151 


50 


8e-05 


35 
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Alignments 



>gi|74827|pir| |VFIHJH genome polyprotein lb - murine hepatitis virus 
(strain JHM) 

Length =2731 

Score = 638 bits (1645), Expect = 0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MT YRRL I SMMGFKJynSTY'QVNG YP3SJMF I TREE AI RHVRAWI GFDVEGCHATRDAVGTNL PLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1585 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHATRDSIGTNFPLQ 1644 

Query: 66 LGFSTGWLVAVPTGYVDTENNTEFTRW 125 

LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 

Sbjct: 1645 LGFSTGIDFVVEATGMFAERDGYWKKAAARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 1704 

Query: 126 MLSDTLKGLSDRVWVLWAHGFELTSMKYFWIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1705 MLSDHLVDLADSWLVTWAASFELTCLRYFAKVGKEWCSVCNKRATCFNSRTGYYGCWR 17 64 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFW 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1765 HSYSCDYLYTSFPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1824 
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Query: 246 RVDWS VE Y P 1 1 GDELRVNS AC RKVQHMVVK S ALL ADKF P VLHD I GNPKAI KC VP QAE VEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1825 SVNWNLEYPI I SNEVS WTSCRLLQRWFRAAMLCNRYDVCYDIGNPKGLACVKG — YDF 1882 

Query : 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNC3STVDRYPANAIVCRFDTRVLSN 365 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVLS 

Sbjct: 1883 KFYDASPV VKSWQFWKYEAHKDQFLDGLCMFWNCNVDKYPANAWCRFDTRVLSK 1939 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQW 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC -J-DYVPL 
Sbjct: 1940 LNLPGCNGGSLYVNKHAFHTNPFTRAAFEN^ 1999 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAY1SIMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2000 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2059 

Query: 486 L 486 
L 

Sbjct: 2060 L 2060 



>gi 

gi 



14917044 |sp|P29982 |RRPB_CVMJH RNA-directed RNA polymerase (ORFlB) 
758332l|gb|AAA46458.2 | open reading frame lb [murine hepatitis virus] 
Length = 2731 



Score = 637 bits (1644), Expect = 0.0 

Identities = 287/481 (59%), Positives ='366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFK3YDKTYQWGYPN 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1585 VT YSRL I S LMGFKLDLTLDGYCKLF I TRDE AI KRVRAWVGFDAEG AHATRDS I GTNF PLQ 1644 

Query: 66 LGFSTGWLVAVPTGYVDTENNTEF^ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 

Sbjct: 1645 LGFSTGIDFWEATGMFAERDGYWKKAAARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 1704 

Query: 126 MLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1705 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEWCSVCNKRATCFNSRTGYYGCWR 1764 

Query: 186 HSVGFDYVYTSTPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFW 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1765 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1824 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPK^ 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1825 SVNWNLEYPI I SNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG--YDF 1882 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 3 65 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVLS 

Sbjct: 1883 KFYDASPV VKSVKQFWKYEAHKDQFLDGLCMFWNCOT 1939 

Query: 3 66 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1940 LNLPGCNGGSLYVNKHAFHTNPFTRAAFENLKPM 1999 
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Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNiyDytlSAGFSLWIYKQFDTYNLWNTFTRLQS 485 

+ S ATC ITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2000 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2059 

Query: 486 L 486 
L 

Sbjct: 2060 L 2060 



>gi | 26007546 | ref |NP__068668 . 2 | ORFlab polyprotein [Murine hepatitis virus] 
Length = 7178 

Score = 637 bits (1644), Expect =0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKJYDNnf^ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sb j c t : 6032 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 6091 

Query: 66 LGFSTGWLVAVPTGYVDTE1OT 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 

Sbjct: 6092 LGFSTGIDFWEATGMFAERDGYWKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIRIVQ 6151 

Query: 126 MLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 6152 MLSDHLADLADSVVLVTWAASFELTCLRYFAKVGREWCSVCTKRATCFNSRTGYYGCWR 6211 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQWGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 6212 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 6271 

Query : 246 RVDWS VEYPI IGDELRVNS ACRKVQHMWKSALLADKFPVLHDIGNPKAIKC VPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK 4- CV ++ 
Sbjct: 6272 SVRIWNLEYPIISNEVSWTSC 6329 

Query : 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFTOJCNVDRYPANAI VCRFDTRVLSN 365 

KFYDA P +++ Y Y H D+F DG+C+FWNC3STVD+YPANA+VCRFDTRVL+ 

Sbjct: 6330 KFYDASFV-— VKSVKQFV^^ 6386 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 6387 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPM 6446 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWI^ 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 6447 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWWKTFDFYNLWNTFTRLQS 6506 

Query: 486 L 486 
L 

Sbjct: 6507 L 6507 

>gi| 7769342 |gb|AAF69332.l|AF208066_2 RNA-directed RNA polymerase [murine 
hepatitis virus] 

Length = 2732 
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Score = 637 bits (1644), Expect =0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISmGFKMJSTY'QWGY 65 

+TY RLIS+MGFK++ ++GY +FITR+EAIR VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1586 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIRRVRAWVGFDAEGAHATRDSIGTNFPLQ 1645 

Query: 66 LGFSTGWLVAVPTGYVDTEISOT 125 

LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 

Sbjct: 1646 LGFSTGIDFWEATGMFAERDGYVFKKAVARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 1705 

Query: 126 MLSDTLKGLSDRVWVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1706 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEWCSVC3STKRATCFNSRTGYYGCWR 1765 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIM 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1766 HSYSCDYLYNPLIVDIQQWGYTGSLTS3SFHDLICSVHKGAHVASSDAIMTRCLAVHDCFCK 1825 

Query : 246 RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+WS+EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1826 SV1SMSLEYPIISNEVSWTSCRLLQRVMFRAAMLCNRYDVCYDI 1883 

Query: 3 06 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 3 65 . 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct: 1884 KFYDASPV VKSVKQFWKYEAHKDQFLDGLCMFWNCNVDKYPANAWCRFDTRVL3NFK 1940 

Query: 3 66 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC ■ +DYVPL 

Sbj ct : 1941 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 2000 

Query: 426 'KSATCITRCNLGGAVCRHHANEYRQYL^^ 485 

+ S ATC I TRCNLGGAVC HA +YR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2001 RSATCITRCNLGGAVCLKHAEDYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2060 

Query: 486 L 486 
L 

Sbjct: 2061 L 2061 



>gi| 662576ljgb|AAF19384.l|AF201929_2 RNA-directed RNA polymerase [murine 
hepatitis virus strain 2] 

gi|7739595]gb|AAF68920.l|AF207902_2 RNA-directed RNA polymerase [murine 
hepatitis virus strain ML- 11] 
Length =2733 

Score = 637 bits (1643), Expect =0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQWGYPNMFITREEA 65 

+TY RLIS+MGFK++ ++GY +FITR+EAIR VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIRRVRAWVGFDAEGAHATRDSIGTNFPLQ 1646 

Query: 66 LGFSTGWLVAVPTGYVDTENNTEFTRW 125 
LGFSTG++ V TG + F + A+ PPG+QFKHL+ PLM +G W+WRI+IVQ 
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Sb j ct : 


1647 


Query : 


126 


Sb j ct : 


1707 


Query: 


186 


Sbjct : 


1767 


Query: 


246 


Sbjct: 


1827 


Query: 


306 


Sbjct: 


1885 


Query: 


366 


Sbjct: 


1942 


Query: 


426 


Sbjct: 


2002 


Query: 


486 


Sbjct: 


2062 



MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 



HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 



V+WS+EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 



KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 



LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 



+ S ATC I TRCNLGGAVC HA +YR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 



>gi|2641128|gb|AAB86818.l| RNA-directed RNA polymerase [murine hepatitis 
virus] 

Length =2733 
Score = 635 bits (1637), Expect = 0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

MTYRRL I SMMGFKMNYQ WGYPIMF ITREEAIRHVRAWI GFDVEGCHATRDAVGTNL PLQ 6 5 
+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 



LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 



MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 



HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 



Query : 


6 


Sbjct : 


1587 


Query: 


66 


Sbjct: 


1647 


Query : 


126 


Sbjct: 


1707 


Query: 


186 


Sb j ct : 


1767 



Query : 246 RVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
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Sbjct: 


1827 


Query: 


306 


Sb j ct : 


1885 


Query: 


366 


Sb j ct : 


1942 


Query: 


426 


Sbj ct : 


2002 


Query: 


486 


Sbj ct : 


2062 



KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 



LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 



+ S ATC I TRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 



>gi | 4377413 |emb|CAA3 62 02. l| open reading frame lb (AA 1-2733) [Murine 
hepatitis virus] 

Length = 2733 

Score = 634 bits (1636), Expect = 0.0 



MTYRRLISmGFKMSTYQ 65 
+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 



LGFSTGWLVAVPTGYVDTE3MTEFTRWAKPPPGDQFK^^ 125 
LGFSTG++ V TG + F + A+ PPG+ QFKHL I PLM +G W+WRI+IVQ 



MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 



HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 



V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 



KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 



LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 



* Identities 


Query : 


6 


Sbjct: 


1587 


Query : 


66 


Sbjct: 


1647 


Query : 


126 


Sbjct: 


1707 


Query : 


186 


Sbjct: 


1767 


Query: 


246 


Sbjct: 


1827 


Query: 


306 


Sbjct: 


1885 


Query: 


366 


Sbj ct : 


1942 



Query: 



426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNmiSAGFSLWIYKQFDTYISnLWNTFTRLQS 485 
+ S ATC I TRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
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Sbjct: 2002 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWWKTFDFYNLWNTFTRLQS 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi 
gi 

A59 



133592 | sp 
93916|pir 



P16342 |RRPB_CVMA5 RNA-DIRECTED RNA POLYMERASE (ORFlB) 
IS15760 genome polyprotein - murine hepatitis virus (strain 



Length = 2733 



Score = 634 bits (1636), Expect = 0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

- MTYRRL I SJytMGFKiynSTZQ WGYPNMF I TREE AI RHVRAWI GFDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 



LGFSTGWLVAVPTGYVDTENNTEFTRWAKPPPGDQFKHLIPLMYKGLPV^ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 



MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 



ES DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 



V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 



KFYDA P +++ Y Y H D+F DG4-C+FWNCNVD+YPANA+VCRFDTRVL+ 



LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 



+ S ATC I TRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 



Query : 


6 


Sbj ct : 


1587 


Query : 


66 


Sbjct: 


1647 


Query: 


126 


Sbjct: 


1707 


Query: 


186 


Sbjct: 


1767 


Query : 


246 


Sbjct: 


1827 


Query: 


306 


Sbjct: 


1885 


Query: 


366 


Sbjct: 


1942 


Query : 


426 


Sbjct: 


2002 


Query : 


486 


Sbj ct : 


2062 



>gi | 26008080 jref |NP_150073 .2 | orflab polyprotein [Bovine coronavirus] 
Length = 7094 

Score = 633 bits (1633), Expect = e-180 
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Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MT YRRL I SMMGFKMlSnirQWGYPNMF ITP^EAIPJIVRAWI GFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sb j c t : 5948 VTYSRLI SLMGFKLDVTLDGYCKLFITKEEAVKRVRAWGFDAEGAHATRDS IGTNFPLQ 6007 

Query: 66 LGFSTGWLVAVPTGYVDTENNTEFTRWAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 

Sbjct: 6008 LGFSTGIDFWEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDWRPRIVQ 6067 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 6068 MFADHLIDLSDCVVLVTWAAOTELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 6127 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 6128 HSVTCDYLYNPLIVDIQQWGYIGSLS SNHDLYCSVHKGAHVAS SDAIMTRCLAVYDCFCN 6187 

Query: 246 RVT)WSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 6188 NINWNVEYPIISNELSINTSCRVLQRVMLKAAM^ 6245 

Query : 306 KF YDAQPCSDKAYKIEELFYS YAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 

KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct: 6246 KFYDAQPI — VKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAWCRFDTRVLNN 6302 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

' LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 6303 LNLPGCNGGSLYVNKHAFHTKPFSPvAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVDYVPL 6362 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYMyMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 

- KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 6363 KSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTKLQS 6422 

Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gi|l5077820|gb|AAK83365.l| replicase [bovine coronavirus] 
Length = 7094 

Score = 633 bits (1633), Expect = e-180 

Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMSHfQWGYPM^IT^ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 5948 VTYSRLISLMGFKLDVTLDGYCIOjFITKEEAWRvRAWVGFDAEGAHATRDSIGTNFPLQ 6007 

Query: 66 LGFSTGWLVAVPTGYVIJTENNTEFTRWAKPPPGD^^ 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 

Sbjct: 6008 LGFSTGIDFWEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDWRPRIVQ 6067 

Query: 126 MLSDTLKGLSDRVWvLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 6068 MFADHLIDLSDCWLVTWAANFELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 6127 
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Query: 


186 


Sbjct : 


6128 


Query: 


246 


Sbjct: 


6188 


Query: 


306 


Sbjct: 


6246 


Query: 


366 


Sbjct: 


6303 


Query: 


426 


Sbjct: 


6363 


Query: 


486 


Sbjct: 


6423 



HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 
HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 



++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 



KFYDAQP ++ L YS+ H D F DG+C+FTOCNVD+YP NA+VCRFDTRVL+N 



LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 



KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 



L 486 
L 



>gi | 18033972 | gb| AAL573 05 . 1 | replicase [bovine coronavirus] 
Length =7094 

Score = 633 bits (1633), Expect = e-180 

Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

MTYRRLISmGFKMNYQWGYPIMFITREEAIRHVRAWIGFDVEGCHAT 6 5 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 



LGFSTGWLVAVPTGYVDTENNTEFTRWAKPPPGDQFK^^ 125 
LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 



MLSDTLKGLSDRVWVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
M +D L LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 



HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 
HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 



RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 



KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 



Query: 


6 • 


Sbjct: 


5948 


Query: 


66 


Sbjct: 


6008 


Query: 


126 


Sbjct: 


6068 


Query: 


186 


Sb j ct : 


6128 


Query: 


246 


Sbjct: 


6188 


Query: 


306 


Sbjct: 


6246 
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Query: 3 66 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 6303 LNLPGCNGGSLYVISTKHAFHTK 6362 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLTO 485 

KSATCITRCNLGGAVC HA EYR+YL+4-YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 6363 KSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWWKTFDFYNLWNTFTKLQS 6422 

Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gi|7769353 | gb | AAF69342 . 1 | AF208067_2 RNA-directed RNA polymerase [murine 
hepatitis virus] 

Length = 2733 

Score = 633 bits (1633), Expect = e-180 

Identities = 285/481 (59%>, Positives = 364/481 (75%), Gaps = 5/481 (1%) 



Query: 6 MTYRRLISMMGFKiynsr^QWGYPNMF 65 

++Y RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 VSYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 1646 

Query: 66 LGFSTGWLVAVPTGYVDTENOT^ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 

Sbjct: 1647 LGFSTGIDFVVEATGMFAERDGWFKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIRIVQ 1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 1707 MLSDHLVDLADSWLVTWAASFELTCLRYFAKVGREWCSVCTKRATCFNSRTGYYGCWR 1766 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1826 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWNLEYPIISNEVSW 1884 

Query: 3 06 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct: 1885 KFYDASPV VKSVKQFWKYEAHKDQFLDGLCMFWNCNVDKYPANAWCRFDTRVLNK 1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPM 2001 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNmiSAGFSLWIYKQFDTYNLWNTFTRLQS 485 

+ S ATC I TRCNLGGAVC HA EYR+YL-f+YN , +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2061 



Query : 
Sbjct: 



486 L 486 
L 

2062 L 2062 
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>gi | 17529672 | gb | AAL40397 . 1 | AF220295_2 RNA polymerase lb [bovine 
coronavirus] 

Length = 2685 

Score = 623 bits (1607), Expect = e-177 

Identities = 282/481 (58%), Positives = 365/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLI SMMGFKMNYQVNGYPNMF ITREE AIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1574 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 1633 

Query : 6 6 LGFSTGVNLVAVPTGYVDTE3SINTEFTRVNAKPPPGDQFKHLI PLMYKGLPW3WVRIKIVQ 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 

Sbjct: 1634 LGFSTGIDFWEATGLFADRDGYSFKKAVAEAPPGEQFKHLIPLMTRGQRWDWRPRIVQ 1693 

Query: 12 6 MLSDTLKGLSDRWFVLWAHGFELTSiyiKYFVKIGPERTCCLCDKRATCFSTSSDTYAC™ 185 

M +D L LSD W V WA FELT ++YF K+G E +C + KRAT +++ + Y CW 
Sbjct: 1694 MFADHLIDLSDCVVLVTWAANFELTCLRYFAKVGREISCNVSTKRATAYNSRTGYYGCWR 1753 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAI^ 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 1754 HSVTCDYLYNPLIVDIQQWGYIGSLSS3STHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 1813 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHD 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 1814 NINWWEYPIISNELSINTSCRVL 1871 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365' 

KFYDAQP ++LY+ HDF DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sb j ct : 1872 KFYDAQP I - - -WSVKTLLYFFEAHKDSFKDGLCMFWNCNVDKYPPNAWCRFDTRVLNN 1928 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 1929: LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYY 1988 

Query : 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMI SAGFSLWIYKQFDTYNLWNTFTRLQS 485 

KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 1989 KS ATC I TRCNLGGAVCLKHAEE YREYLE S YNTATT AGFTFWVYKTFDF YNLWNTFTKLQS 2048 

Query: 486 L 486 
L 

Sbjct: 2049 L 2049 



>gi | 25121571 | ref |NP_740618 . 1 | coronavirus nspll [Murine hepatitis virus] 
Length =521 

Score = 622 bits (1603), Expect = e-177 

Identities = 284/479 (59%), Positives = 362/479 (75%), Gaps = 5/479 (1%) 

Query: 6 MTYRRLI SI^GFKMRnfQWGYP 65 

+TY RLIS+MGFK++ -J-+GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 48 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 107 

Query: 66 LGFSTGVNLVAVPTGYVDTEINnSfTEF 125 
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Sbjct: 


108 


Query : 


126 


Sbjct: 


168 


Query: 


186 


Sbjct: 


228 


Query: 


246 


Sbjct: 


288 


Query: 


306 


Sbjct: 


346 


Query: 


366 


Sb jet : 


403 


Query: 


426 


Sbjct: 


463 



LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 



MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 



HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 



V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 



KFYDA P +++ Y Y H D+F DG+C+FWNC1WD+YPANA+VCRFDTRVL+ 



LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 



+ S ATC ITRCNLGGAVC HA EYR+YL+'+YN +AGF+ W+YK FD YNLWNTFTRLQ 



>gi|26008092 |ref |NP_742140.l| coronavirus nspll [Bovine coronavirus] 
Length = 521 

Score = 617 bits (1590), Expect = e-175 

Identities = 282/479 (58%), Positives = 365/479 (76%), Gaps = 5/479 (1%) 



Query: 6 MTYRRLISMMGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCI^ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 48 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 107 

Query: 66 LGFSTGWLVAVPTGYVDTENOT 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 

Sbjct: 108 LGFSTGIDFWEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDWRPRIVQ 167 

Query: 126 MLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 168 MFADHLIDLSDCWLVTWAANFELTCLRYFAKVGREISC3WCTKRATAYNSRTGYYGCWR 227 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQS3STHDQHCQVHGNAHVASCDAIMTRCLAVHECFW 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 228 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 287 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 288 NINWIWEYPIISNELSINTSCRV^ — KDFDF 345 

Query : 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNC3WDRYPANAIVCRFDTRVLSN 365 
KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 
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Sbjct: 346 KFYDAQPI WSWTLLYSFEAHKDSFKDGLCMF^ 402 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 403 LNLPGCNGGSLYVNKHAFHTKPFSR 462 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYK^^ 484 

KS ATC I TRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQ 
Sbjct: 463 KSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTKLQ 521 



>gi | 10242469 | ref | NP_066134 . 1 1 ORFlab polyprotein; frameshift product [Avian 
infectious bronchitis 
virus] 
Length = 6629 

Score = 575 bits (1482), Expect = e-163 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQWGYPN^ 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 5515 EiTYKHLISLLGFKMSVNVEGC:^ 5574 

Query: 65 QLGFSTGVNLVAVPTGYVDTEl^ 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 5575 QVGFSTGADFWTPEGLVDTSIGNNFEPWSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 5634 

Query: 125 QMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 5635 QMLADNLCWSDCVWVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 5693 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 5694 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASVDAIMTRCLAIN^ 5753 

Query: 245 KRVDWSVEYPIIGDELRWSACRKVQHMVWSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 5754 QDV1MDLTYPHIANEDEVNSS 5813 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFVmCNVDRYPANAIVCRFDTRVLS 364 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 5814 FRFYDKNPIVRNVKQFE- - - YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 5870 

Query: 3 65 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 

NLPGC+GGSLYVNKHAF-fTP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 5871 WNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-D 5929 

Query: 425 LKSATCITRCNLGGAVCRHHAJNTEYRQYL^ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 5930 LATKDCITKCNIGGAVCKKHAQMYAEFVT^ 5989 



Query: 485 SL 486 
S+ 

Sbjct: 5990 SI 5991 
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>gi| 14149033 |emb|CAC3 9112.1) replicase polyproteiri lab [Avian infectious 
bronchitis virus (strain 
Beaudette CK) ] 
Length = 6629 

Score = 575 bits (1482), Expect = e-163 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRL I SMMGFKMNYQVNGYPNMF I TREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 5515 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIR^ 5574 

Query: 65 QLGFSTGVBTkVAVPTGYTO^ 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 5575 QVGFSTGADFWTPEGLVDTSIGNNFEPWSKAPPGEQFNHLRVLFKSAKPW 5634 

Query: 125 QMLSDTLKGLSDRVWVLWAHGFELTS^YFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 5635 QMLADNLCNVS DC WF VTWCHGLELTTLRYFVKI GKEQVC S - C G SRATTFNS HTQAYAC W 5693 

Query: 185 NHSVGFDYWNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA+ + F 
Sbjct: 5694 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAH^ 5753 

Query: 245 KRVDWSVEYPIIGDELRWSACRKVQHM 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 5754 QDVNWDLTYPHIANEDEWSSCRY 5813 

Query : 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 5814 FRFYDKISTPIVRNVKQFE YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 5870 

Query: 365 ■ NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 5871 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 5929 

Query: 425 LKSATCITRCNLGGAVCRHHANEY 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 5930 LATKDCITKCNIGGAVCKKHAQMYAEFVTSYNAAVTAGFTFWT3SKLNPYNLWKSFSALQ 5989 

Query: 485 SL 486 
S+ 

Sbjct: 5990 SI 5991 

>gi|458735|einb|CAA83018.l| potential chimeric protein [Avian infectious 
bronchitis virus] 

Length = 2155 

Score = 570 bits (1470), Expect = e-161 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKM^ 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1596 EITYKHLISLLGFKMSVWEGCHNMFITRDEAIRN^ 1655 
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Query: 65 QLGFSTGVNLVAVPTGYVDTEIMTEFTRWAKPPPGDQFra 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 1656 QVGFSTGADFWTPEGLVDTSIGl^FEP 1715 

Query: 125 QMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1716 QMLADISHjC WSDCVVF VTWCHGLELTTLRYF VKIGKEQVCS - CGSRATTFNSHTQAYACW 1774 

Query : 185 NHS VGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECF V 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VH6+AHVAS DAIMTRCLA+ + F 
Sbjct: 1775 KHCLGFDFVYNPLLVDIQQWGYSGNLQF3SFHDLHC1WHGHAHVASVDAIMTRC 1834 

Query: 245 KRVDWS VE YP 1 1 GDELRVNS ACRKVQHMWKS ALLADKF PVLHD I GNPKAI KCVPQAEVE 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 1835 QDVNWDLT YPHI ANEDE WS SC GNPKGIKCVRRGDVN 1894 

Query : 3 05 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAI VCRFDTRVLS 364 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 1895 FRFYDKNPIVRNVKQFE YDYNQHKDKFADGLCMFWNC3STVDCYPDNSLVCRYDTRNLS 1951 

Query : 365 NLNLPGCDGGSLYV]SIKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 1952 WNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 2010 

Query: 425 LKSATCITRCNLGGAVCRHHAN 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 2011 LATIODCITKCNIGG 2070 

Query: 485 SL 486 
S+ 

Sbjct: 2071 SI 2072 



>gi|l33594|sp 
gi | 74826 |pir 
virus (strain 

Beaudette) 



P26314 |RRPB_IBVB RNA-DIRECTED RNA POLYMERASE (ORFlB) 
|VFIHB2 genome polyprotein - avian infectious bronchitis 



gi 



292953 gb|AAA70234.1 
331173 gb|AAA46224.1 
Length = 2652 



pol protein [Avian infectious bronchitis virus] 
ORFlb [Avian infectious bronchitis virus] 



Score = 570 bits (1469), Expect = e-161 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMD^QWGYPNMF 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1538 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWGFDVEAT 1597 

Query: 65 QLGFSTGWLVAVPTGYVDTE3MTEFTRWAKPPPGDQFKHLIPLMYKGLPW1WW 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 1598 QVGFSTGADFWTPEGLVDTSIG3MFEPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 1657 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1658 QMLADNLCNVSDCWF VTWCHGLELTTLRYFVKIGKEQVCS - CGSRATTFNSHTQAYACW 1716 
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Query: 185 NHSVGFDYVYNPFMIDVQQW 244 

H +GFD+VYNP ++D+QQWG++GNLQ 3SFHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 1717 KHCLGFDFVYNPLLVDIQQWGYSGNL^ 1776 

Query: 245 KRVDWSVEYPIIGDELRVNSACRKVQHMWKSALLAD^ 304 

+ V+W + YP I +E VNS+CR +Q M + + 4- A K V++DIGNPK IKCV + +V 
Sbjct: 1777 QDVIMDLTYPHIANE^ 1836 

Query : 305 WKFYDAQPCSDKAYKIEELFYS YAIHHDKFTDGVCLFWNCNVDRYPANAI VCRFDTRVLS 364 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 1837 FRF YDKNP I VRNVKQFE YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 1893 

Query: 365 NLNLPGCDGGSLYVISIIQI^FHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDI 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 1894 WNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-D 1952 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYISIMMISAGFSLWIYKQFDTYNLWNTFTRLQ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 1953 LATKDCITKCNIGGAVCKKHAQMYA 2012 

Query: 485 SL 486 
S+ 

Sbjct: 2013 SI 2014 , 



>gi|29293454|gb|AAO67706.l| ORFlb polyprotein [Avian infectious bronchitis 
virus] 

Length = 2 649 
Score = 565 bits (1455), Expect = e-160 

Identities = 261/482 (54%), Positives = 342/482 (70%), Gaps = 8/482 (1%,) 

Query: 5 DMTYRRLISMMGFKMKTC 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1538 EITYKHLISLLGFKMSVWEGCHINM^ 1597 

Query: 65 QLGFSTGWLVAVPTGYVDTENOT 124 

Q+GFSTG + V P G +DT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 1598 QVGFSTGADFWTPEGLIDTSIGNNFEPWSK^ 1657 

Query: 125 QMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1658 QMLADNLCWSDCVVFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 1716 

Query: 185 3STHSVGFDYVYNPFMIDVQQWGFTGNLQS3STHDQHCQVHGNAHVASCDAIMTR 244 

H +G VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DA+MTRCLA+ + F 
Sbjct: 1717 RHCLG VYNPLLVDIQQWGYSGNLQFNHDLHCNV^ 1773 

Query: 245 KRVDWSVEYPIIGDELRWSACRKVQHM 304 

K V+W ++YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 1774 KDWWELQYPHIANEDEWSSCRYLQRM^ 1833 

Query : 305 WKF YDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 

++FYD P + E Y Y+ H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 1834 FRFYDKNPIVPNVKQFE YDYSQHKDKFADGLCiyiFWNCNVDCYPENSLVCRYDTRNLS 1890 
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Query: 3 65 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 

NLPGC+GGSLYVNKHAFHTP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 1891 WNLPGCNGGSLYVISKHAFHTPK^ 1949 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNMI^ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ F+ YNLW F+ LQ 
Sbjct: 1950 LATKDCITKCNIGGAVCKKHAQMYAEFWSYN^ 2009 

Query: 485 SL 486 
S+ 

Sbjct: 2010 SI 2011 

>gi | 25121555 |ref |NP_740631 . 1 | coronavirus nspll [Avian infectious 
bronchitis virus] 

Length = 521 

Score = 559 bits (1440), Expect = e-158 

Identities •= 261/480 (54%), Positives = 342/480 (71%), Gaps = 5/480 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQWGY 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 47 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNW^ 106 

Query: 65 QLGFSTGWLVAVPTGYVDTEW 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 107 QVGFSTGADFWTPEGLVDTSIGNNFEPWSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 166 

Query: 125 QMLSDTLKGLSDRVWVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 167 QMLADNLCIWSDCWFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 225 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVH^ 244 : 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC YHG+AHVAS DAIMTRCLA++ F 
Sbjct: 226 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHD 285 

Query: 245 KRVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 286 QDVNWDLTYPHIANEDEWSSCRYL^ 345 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFVOTCNVDRYPANAIVCRFDTRVLS 364. 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 346 FRFYDKNPIVRNVKQFE YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 402 

Query: 3 65 ISn^NLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 403 WNLPGCNGGSLYVNKHAFYTPKFDRISFRN^ 461 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNM^ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 462 LATKDCITKCNIGGAVCKKHAQMYAEFVTSYNAAVTAGFTFWVTNKLNPYNLWKSFSALQ 521 



>gi | 9635157 | ref |NP_058422 . 1 | replicase [Transmissible gastroenteritis 
virus] 

gi | 7801348 | emb| CAB91143 .1 | replicase [Transmissible gastroenteritis virus] 
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Length = 6685 
Score = 545 bits (1403), Expect = e-153 

Identities = 261/484 (53%), Positives = 335/484 (69%), Gaps = 13/484 (2%) 

Query: 4 KDMTYRRLISMMGFKMNYQWGYPNM^ 63 

KD+ Y +IS MGF+ + GY +F TR+ A+R+VRAW+GFDVEG H D VGTN+P 
Sbjct: 5574 KDVKYANVT S YMGFRFEANI PGYHTLFCTRDFAMRISrVRAWLGFDVEGAHVCGDWGTWP 5633 

Query: 64 LQLGFSTGWLVAVPTGYVDTEI^ 123 

LQLGFS GV+ V G V TE V A+ PPG+QF HLIPLM KG PW++VR +1 

Sbjct: 5634 LQLGFSNGVDFWQTEGCVITEKGNSIEVW 5693 

Query: 124 VQMLSDTLKGLSDRVWVLWAHGFELTSMKYFVKIG-PERTCCLCDKRATCFSTSSDTYA 182 

VQM+ D GLSD ++FVLWA G ELT+M+YFVKIG P++ C C K ATC+S+S YA 
Sbjct: 5694 VQMVCDYFDGLSDILIFVLWAGGLELTTMRYFVKIGRPQK — CECGKSATCYSSSQSVYA 5751 

Query: 183 CWlSfHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVH 242 

C+ H++G DY+YNP+ ID+QQWG+TG+L NH + C +H N HVAS DAIMTRCLA+H+C 
Sbjct: 5752 CFKHALGCDYLYNPYCIDIQQWGYTGSLSMNHHEVCNIHRNEHVASGDAIMTRCLAIHDC 5811 

Query: 243 FWRVDWSVEYPIIGDELRWSACRKVQHMVWSALLADKFPVLHDIGNPKAIKCVPQAE 302 

FVKRVDWS+ YP I +E ++N A R VQ V+K+AL +HD+GNPK I+C 

Sbjct: 5812 FWRVDWSIWPFIDNEEKINKAGRIVQSHVMKAALKIFNPAA^ 5870 

Query : 303 VEWKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRV 3 62 

+ W YD. P ++ + L Y Y +H +G+ LFWNCNVD YP +IVCRFDTR 

Sbjct: 5871 I PWFC YDRDP IMF NVRCLDYDYMVHGQ--MNGLMLFWNCNVDMYPEFSIVGRFDTRT 5925 

Query: 363 LSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDY 422 

S L+L GC+GG+LYVN HAFHTPA+D+ AF LK +PFFYY DS CE V +Y 

Sbjct: 5926 RSKLSLEGCNGGALYVNNHAFHTPAYDRRAFAKLKPMPFFYYDDSNCE LVDGQPNY 5981 

Query: 423 VPLKS ATC ITRCNLGGAVCRHHANEYRQYLDAYNMMI S AGFSLWI YKQFDTYNLWNTFTR 482 

VPLKS CIT+CN+GGAVC+ HA" YR Y++ YN+ + AGF++W + FDTY LW+ F 
Sbjct: 5982 VPLKSWCITKCNIGGAVCKKHAALYRAYVEDYNIFMQAGFTIWCPQNFDTYMLWHGFVN 6041 

Query: 483 LQSL 486 
++L 

Sbjct: 6042 SKAL 6045 



>gi 
gi 



19387582 
13752450 



ref |NP_598309 . 1 | Poll [porcine epidemic diarrhea virus] 
gb| AAK38661 . 1 | Poll [porcine epidemic diarrhea virus] 



Length = 6781 



Score = 541 bits (1394), Expect = e-152 

Identities = 256/480 (53%), Positives = 334/480 (69%), Gaps = 12/480 (2%) 

Query: 8 YRRLISMMGFKJyDN^ 67 

Y +IS MGF+ + + + +F TR+ A+R+VR W+GFDVEG H VGTN+PLQLG 
Sbjct: 5675 YEHVISFMGFRFDINIPNHHTLFCTRDFAMRIWRGWLGFDVEGAHW^ 5734 

Query: 68 FSTGWLVAVPTGYVDTENOT 127 

FS GV+ V P G V TE+ V A+ PPG+QF HL+PL+ +G PW+WR +IVQM 

Sbjct: 5735 FSNGVDFWRPEGCWTESGDYIKPVRARAPPGEQFAHLLPLLKRGQPWDWRKRIVQMC 5794 
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Query: 128 SDTLKGLSDRVWVLWAHGFELTSMKYFWIGPERTCCLCDKI^TCFSTSSDTYACWNHS 187 

SD L LSD ++FVLWA G ELT+M+YFVKIGP ++C C K ATC++++ TY C+ H+ 
Sbjct: 5795 SDYLANL SDI L I F VLWAGGLELTTMRYF VKI GPSKS CD - CGKVATC YNS ALHTYCCFKHA 5853 

Query : 188 VGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVKRV 247 

+G DY+YNP+ ID+QQWG+ G+L NH +HC VH N HVAS DAIMTRCLA+H+CFVK V 
Sbjct: 5854 LGCDYLYNPYCIDIQQWGYKGSLSLNHHEHCNVHRNEHVASGDAIMTRCLAIHDCFV^ 5913 

Query : 248 DWSVEYPI IGDELRVNS ACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEWKF 307 

DWS+ YP IG+E +N + R VQ ++S L ++DIGNPK I+C + +W 

Sbjct: 5914 DWSITYPFIGNEAVINKSGRIVQSHTMRSVLKLY^ 5972 

Query : 308 YDAQPCSDKAYKIEELF YSYAIHHDKFTDGVCLFWNCNVDRYPANAI VCRFDTRVLSNLN 367 

+D P + +E Y Y I H +F DG+CLFWNCNVD YP ++VCRFDTR S LN 
Sbjct: 5973 FDKNPTNSNVKTLE YDY - 1 THGQF - DGLCLFWNCNVDMYPEF S WCRFDTRCRS PLN 6027 

Query: 368 LPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPLKS 427 

L GC+GGSLYVN HAFHTPAFDK AF LK +PFF+Y D+ C+ ++ I+YVPL++ 
Sbjct: 6028 LEGCNGGSLYVNNHAFHTPAFDKRAFAKLKPMPFFFYDDTECD KLQDSINYVPLRA 6083 

Query: 428 ATCITRCNLGGAVCRHHANEYRQYLDAYTXim 486 

+ CIT+CN+GGAVC H Y Y++AYN SAGF++W+ FDTYNLW TF+ LQ L 
Sbjct: 6084 SNCITKCWGGAVCSKHCAMYHSYWAYNTFTSAGFTIWPTSFDTYNLWQTFSNNLQGL 6143 



>gi | 12175747 | ref | NP_073549 . 1 1 replicase polyprotein lab [Human coronavirus 
229E] 

gi|l2082740|gb|AAG48591.l|AF304460_2 replicase polyprotein lab [Human 
coronavirus 229E] : 

Length =6758 

Score = 535 bits (1379), Expect = e-151 

Identities = 254/478 (53%), Positives = 329/478 (68%), Gaps = 13/478 (2%) 

Query: 7 TYRRLISMMGFKMNYQWGYPNMF 66 

TY +IS MGF+ + + G ++F TR+ A+RHVR W+G DVEG H T D VGTN+PLQ+ 
Sbjct: 5642 TYEHVISYMGFRFDVSMPGSHSLFCTRDFAMRHWGWLGMDVEGAHVTGDWGTWPLQV 5701 

Query: 67 GFSTGWLVAVPTGYVDTENNTEFTRWAK^ 126 

• GFS GV+ VA P G V T + V A+ PPG+QF H++PL+ KG PW+V+R +IVQM 
Sbjct: 5702 GFSNGVDFVAQPEGCVLTNTGSVWPVRARAPPGEQFTHIVPLLRKGQPWSVLRKRIVQM 5761 

Query: 127 LSDTLKGLSDRWFVLWAHGFELTS]VCKYFVKIGPERTCCLCDKRATCFSTSSDTYACWNH 186 

++D L G SD +VFVLWA G ELT+M+YFVKIG + C C ATC+++ S+ Y C+ H 
Sbjct: 5762 I ADFL AGS SDVLVF VLWAGGLELTTMRYF W 5820 

Query: 187 SVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVKR 246 

++G DYVYNP++ID+QQWG+ G+L +NH C VH N HVAS DAIMTRCLAV++CFVK 
Sbjct: 5821 ALGCDYVYNPWIDIQQWGWGSLSTNHHAICISTVHRNEHVASGDAIM 5880 

Query: 247 VDWSVEYPIIGDELRWSACRKVQHWVKSALLADKFPVL 306 

VDWS+ YP+I +E +N R VQ ++++A+ +HDIGNPK I+C + +W 

Sbjct: 5881 VDWSITYPMIANENAINKGGRTVQSHIMRAAIKLYNPKAIHDIGNPKGIRCA-WDA^^ 5939 

Query: 307 FYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSNL 3 66 
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YD P + +E Y Y H DG+CLFWNCNVD YP +IVCRFDTR S L 

Sbjct: 5940 C YDKNP I NSNVKTLE YDYMTHGQ - -MDGLCLFWNCNVDMYPEF S I VCRFDTRTRSTL 5994 

Query: 367 NLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSD-IDYVPL 425 

NL G +GGSLYVN HAFHTPA+DK A LK PFFYY D CE WD ++YVPL 

Sbjct: 5995 NLEGWGGSLYVlSnSTHAFHTPAYDKRAMAKLKPAPFFYYDDGSCE WHDQVNYVPL 6049 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYl^ 483 

++ CIT+CN+GGAVC HAN YR Y+++YN+ AGF++W+ FD YNLW TFT + 
Sb j c t : 6050 RATNCITKCNIGGAVCSKHANLYRAYVESYNIFTQAGFNI WVPTTFDCYNLWQTFTEV 6107 



>gi 13359l|sp Pi 8 4 5 8 | RRPB__BEV RNA-directed RNA polymerase (ORFlB) 
gi 94017 | pir |S11238 polymerase - Berne virus 

gi 1334814|emb|CAA36601.l| 2nd polymerase reading frame (AA 1-2291) [Berne 
virus] 

Length = 2291 
Score = 50.1 bits (118), Expect = 8e-05 

Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 11/103 (10%) 

Query: 140 FVLWAHGFELTS3y[KYFVKIGPERTC--CLCDKRATCFSTSSDTYACWNHSVGF — DYVYN 195 

F+L4-+ +L S+K++V+ TC C C + A C + Y C N G + N 

Sbjct: 1511 F I L YS C SNDLKS LKF YVEFD TCYFCSCGEMAICLMRDGN-YKCRNCYGGMLISKLVN 1566 

Query: 196 PFMIDVQQWGFTGNLQSNHDQHC-QVHGNAHVASCDAIMTRCL 237 

+DVQ+ LQ HD C Q HG++H A CDA+MT+CL 

Sbjct: 1567 CKYLDVQKERV--KLQDAHDAICQQFHGDSHEALCDAVMTKCL 1607 

>gi | 1513061 |dbj |BAAl3323.l| cyanoprotein alpha subunit precursor [Riptortus 
clavatus] 

Length = 693 
Score = 34.7 bits (78), Expect = 3.7 

Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 1/36 (2%) 

Query: 371 CDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSD 406 

C G LY +KHA P FD+ A+ + Q+P FY+ D 
Sbjct: 643 CGGSKLYDSKHAMGFP-FDRPAYPDAFQVPNFYFKD 677 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

Posted date: Apr 11, 2003 2:30 AM 
Number of letters in database: 454,141,287 
Number of sequences in database: 1,411,415 

Lambda K H 

0.325 0.139 0.456 



Gapped 

Lambda K H 

0.267 0.0410 0.140 
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Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 473,361,261 

Number of Sequences: 1411415 

Number of extensions: 20503315 

Number of successful extensions: 51018 

Number of sequences better than 10.0: 27 

Number of HSP's better than 10.0 without gapping: 26 

Number of HSP's successfully gapped in prelim test: 1 

Number of HSP's that attempted gapping in prelim test: 50937 

Number of HSP's gapped (non-prelim): 33 

length of query: 486 

length of database: 454,141,287 

effective HSP length: 127 

effective length of query: 359 

effective length of database: 274,891,582 

effective search space: 98686077938 

effective search space used: 98686077938 

T: 11 

A: 40 

XI: 15 ( 7.0 bits) 
X2: 38 (14.6 bits) 
X3: 64 (24.7 bits) 
SI: 40 (21.6, bits) 
S2: 75 (33.5 bits) 
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FIGURE 125A 
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FIGURE 125B 
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FIGURE 126 

5' 3' Frame 1 

QVHQNVCVL- L IFYLMTLSR- - SHKICQ-FQKWSRLQLTMLKFHSCFGVRMDMLKPSTQN 

YKQVKRGNQVLRCLTCTRCKECFLKSVTFRIMVKMLLYQKE MSQSILNCVNT-IHLL 

-LYPPT-ELFTLVL 

5'3' Frame 2 

RFIKMCVFCD-SFT — LCRDNKVTRFVSDFKSGQGYN-LC-NFIHALV-GWTC-NLLPKT 
TSKSSVATRCCDA-LVQDAKNAS-KV-PSELW-KCCYTKRNNDECRKVYSTVSILKYTYF 
SCTLQHESYSLWCW 

5" 3' Frame 3 

GSSKCVCSVIDLLLDDFVEI IKSQDLSVI SKWKVTIDYAEI SFMLWCKDGHVETFYPKL 
QASQAWQPGVAMPNLYKMQRMLLEKCDLQOTGENAVIPKGIMMWAKYTQLCQYLNTLTL 
AVPSNMRVI HFGAG 

3 ! 5' Frame 1 

PAPK-ITLMLEGTAKVSVFKY-HS-VYFATFIIIPFGITAFSP-F-RSHFSRSILCILYK 
LGIATPGCHA-LACSFG-KVSTCPSLHQSMNEISA-SIVTLTTFEITDKSCDFIISTKSS 
SKRSITEHTHFDEP 

3 '5' Frame 2 

QHQSE-LSCWRVQLK-VYLSIDTVEYTLRHSSLFLLV-QHFHHNSEGHTFQEAFFASCTS 
-ASQHLVATLDLLWLGRRFQHVHPYTKA-MKFQHSQL- P- PLLKSLTNLVTLLSRQSHQ 
VKDQSQNTHILMNL 

3' 5' Frame 3 

STKVNNSHVGGYS-SKCI-VLTQLSILCDIHHYSFWYNSIFTIILKVTLFKKHSLHLVQV 
RHRNTWLPRLTCL-FWVEGFNMS ILTPKHE -NFS IVNCNLDHF -NH- QIL- LYYLDKVIK 
-KINHRTHTF-T 
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FIGURE 127 

5*3' Frame 1 

-VFTYPGKANQPRSLVDLFSKRTN-NV- -WTPIKPT-CPPHYIWWTHRFN-Q- PEWRTAM 
GQGQNSADPKVYPIILRLGSQLSLSMARRNLDSLEARAFQSTPIWQMTKLATTEELPDE 
FVWTAK- KS SAPDGTS IT - ELAQKLHFPTALTKKASYGLQLRE P - IHPKTTLAPAI LIT 
MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHWAVIQEIQLLAAVGEILLLEWLAE 
WKLPSRYCC-TD-TSLRAKFLVKA]SnMKAKLSLRNLLLRHLKSLAKNVLPQNSTTSLKH 
LGDWQNKPKEISGTKT-SDKELITNIGPQIAQFA 

5' 3" Frame 2 

RFLPTQEKPTNLDLL-ICSLNEQIKMSDNGPQSNQRSAPRITFGGPTDSTDMNQNGGLQW 
GKAKTAPTPRFTQ-YCVLVHSSHSAWQGGT-IPSRPGRSNQHQ-WSR-PNWLLPKSYPTS 
SWW-RQNERAQPQMVLLLPRNWPRSFTSLRR-QRRHRMGCN-GSLEYTQRPHWHPQS--Q 
CCHRATTSSRNNIAKRLLRRGKQRRQSSLFSLLIT-SR-FKKFNSWQQ-GKFSCSNG-RR 
W-NCPRAIAARQIEPA-EQSFW-RPTTTRPNCH-EICC-GI-KASPKTYCHKTVQRHSSI 
WETWSRTNPRKFRGPRPNQTRN-LQTLGRKLHNLP 

5 '3' Frame 3 

GFYLPRKSQPTSISCRSVL-TNKLKCLIMDPNQTNWPPALHLVDPQIQLTITRMEDCNG 
ARPKQRRPQGLPNNIASWFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRV 
RGGDGKMKELSPRWYFYYLGTGPEASLPYGANKEGIVWATEGALNTPKDHIGTRNPMSIN 
AATVLQLPQGTTLPKGFYAEGSRGGSQAS SRS S SRSRGNSRNSTPGS SRGNS PARMASGG 
GETALALLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAF 
GRRGPEQTQGNFGDQDLIRQGTDYKHWAANCTIC 

3 '5' Frame 1 

RQ I VQFAAQCL - S VPCLIRSWS PKF PWVC SGPRL PNA- VTLYC FVAVRFWRGFLDAS AAD 
FLVTVWPCCCWPLPETLLSSWFNLSSSNSARAVSPPPLAIRAGEFPLLLPGVEFLELPRL 
RDEEREEA-LPPLLPSA-KPFGNWP-GSCSTVAALLLGLRVPMWSLGVFKAPSVATHTM 
PSLLAP-GSEASGPVPR--KYHLGLSSFILPSPPRTRRVALR--PIWSSGPLLVLIGTPW 
PRGNLSSSLPC-VRAVNQDAILLGKPWGRRCFGLAPLQSSILVIVS-ICGSTKCNAGGTT 
LV-LGSIIRHFNLFV-RTDLQEIEVGWLFLGR-KP 

3' 5' Frame 2 

GKLCNLRPNVCNQFLV- LGLG PRNFLGF VLDHVS QMLE - RCTVLWQYVFGEAF - MPQQQ I 
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S — QFGLVWGLYQKLCSQAGS ICLAAIARGQFHHLR-PFEQENFPYCCQELNFLNYRDY 
VMRSEKRLDCRLCFPLRRSLLAMLFLEEWARWQHCY-DCGCQCGLWVYSRLPQLQPIRC 
LLC-RRREVKLLGQFLGNRSTIWG-ALSFCRHHHELVG-LFGSSQFGHLDHYWC-LERPG 
LEGI-VPPCHAE-EL-TKTQYYWVNLGVGAVLALPHCSPPFWLLSVESVGPPNVMRGALR 
WFDWGPLSDILICSFREQIYKRSRLVGFSWVGKNL 

3' 5" Frame 3 

ANCAICGPMFVISSLSD-VLVPEISLGLFWTTSPKCLSDWLFCGSTFLARLFRCLSSRF 
LSDSLALLLLAFTRNFALKLVQSV-QQ-REGSFTTSASHSSRRISPTAARS-IS-ITATT 
— GARRGLTAASASLCVEAFWQCCSLRKL-HGGSIVIRIAGANWFGCIQGSLSCNPYDA 
FFVSAVGK-SFWASS-VIEVPSGAELFHFAVTTTNSSGSSSWANLVIWTTIGVDWNALA 
SRESKFLLAMLSESCEPRRNIIG-TLGSALFWPCPIAVLHSGYCQLNLWVHQM-CGGHYV 
GLIGVHYQTF - FVRLENRSTRDRGWLAFPG- VKT 



FIGURE 128 

-GLELKL-LTSICAF-PFCYSLF — CLLYFGFHSKSRI-KNLVPKSKRT-NFSLF-LVFL 
YAVAYAL-YSAVHLINLMCLKILVRYNTRGNTYSTAWLCALGKVLPFHRWHTWQTCTPN 
VTINCQDPAGGALIARCWYLHEGHQTAAFRDVLWLNKRTN-NV- -WTPIKPT-CPPHYI 
WWTHRFN-Q-PEWRTQWGKAKTAPTPRFTQ-YCVLVHSSHSAWQGGT-IPSRPGRSNQHQ 
; -WSR-PISMLLPKSYPTSSM-RQNERAQPQWLLLPPJMPRSFTSLRR-QRRHRMGCN-G 
SLEYTQRPHWHPQS--QCCHRATTSSRNNIAKRLLRRGKQRRQSSLFSLLIT-SR-FKKF 
NSWQQ-GKFSCSNG-RRW-NCPRAIAARQIEPA-EQSFW-RPTTTRPNCH-EICC-GI-K 
ASPKTYCHKTVQRHSSIWETWSRTNPRKFRGPRPNQTRN-LQTLAANCTICSKCLCILWN 
VTHWHGSHTFGNMADLSWSH-IG-QRSTIQRQRHTAEQAH-RIQNIPTNRA-KGQKEKD- 
-SSAFAAETKEAAHCDSSSC 

EDSSSSFN-LLFVLFSLSAIPCFNNAYYILVFTRNPGSRRTLYQSLNEHETSHCFDLYFS 
MQLHMHCSTALCI — TSCA-RSL-GTTLGVILIALLGFVL-ERFYLFIDGTLWFKHAHLM 
LLSTVKIQLWRL-LGVGTFMKVTKLLHLETYLLF-INEQIKMSDNGPQSNQRSAPRITF 
GGPTDSTDNNQNGGRNGARPKQRRPQGLPNNIASWFTALTQHGKEELRFPRGQGVPINTN 
SGPDDQIGYYRRATRRVRGGDGKMKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEG 
ALNTPKDHIGTRNPNNNAATVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNS 
TPGSSRGNSPARMASGGGETALALLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKK 
PRQKRTATKQYNVTQAFGRRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGM 
SRIGMEVTPSGTWLTYHGAIKLDDKDPQFKDNVILLNKHIDAYKTFPPTEPKKDKKKKTD 
EAQPLPQRQKKQPTVTLLP 
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RTRAQALIDFYLCFLAFLLFLVLIMLIIFWFSLEIQDLEEPCTKV-TNMKLLIVLTCISL 
CSCICTWQRCASNKPHVLEDPCKVQH-G-YL-HCLALCSRKGFTFS-MAHYGSNMHT-C 
YYQLSRS SWWCAYS - VLVPS -RS PNCC I -RRTCCFK-TNKLKCLIMDPNQTNWPPALHL 
VDPQIQLTITRMEDAMGQGQNSADPKVYPIILRLGSQLSLSMARRNLDSLEARAFQSTPI 
WQMTKLATTEELPDEFWVTAK-KSSAPDGTSIT-ELAQKLHFPTALTKKASYGLQLRE 
P-IHPKTTLAPAILITMLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHWAVIQEIQ 
LLAAVGEILLLEWLAEWKLPSRYCC-TD-TSLRAKFLVKANMIKAKLSLRNLLLRHLKS 
LAKNVLPQNSTTSLKHLGDWQNKPKEISGTKT-SDKELITNIGRKLHNLLQVPLHSLEC 
HALAWKSHLREHG-LIMEPLNWMTKIHNSKTTSYC-TSTLTHTKHSHQQSLKRTKRKRLM 
KLSLCRRDKRSSPL-LFFL 



FIGURE 129 

5' 3' Frame 1 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttaccct 

YRRLI SMMGFKMNYQVNGYP 
aatatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgat 

NMFITREEAIRHVRAWIGFD 
gtagagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctagga 

V' EGCHATRDAVG TN L PLQLG 
ttttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataac 

F S T G V N L VAV P T G Y V D T E N N 
acagaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

TEFTRVNAKPPPGDQFKHLI 



5 • 3 • Frame 2 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttacccta 

TVDSSL-WVSK-ITKSMVTL 
atatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatg 

ICLSPAKKLFVTFVRGLALM 
tagagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctaggat 

-RAVMQLEMLWVLTYLSS-D 
tttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataaca 

FLQVLT - - LYRLVMLTLKIT 
cagaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

QNSPELMQNLHQVTSLNILY 
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5' 3' Frame 3 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttaccctaa 

P -THLYDGFQNELP'S QWL P - 
tatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatgt 

YVYHPRRSYSSRSCVDWL-C 
agagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctaggatt 

RGLSCN-RCCGY-PTSPARI 
ttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataacac 

FYRC-LSSCTDWLC-H-K-H 
agaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

RIHQS-CKTSTR-PV-TSYT 



3 '5' Frame 1 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattct 

G.IRCLNWS PGGGFALTLVNS 
gtgttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaa 

VLFSVST- PVGTATKLTPVE 
aatcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctct 

N P S W R G RLVPTAS LVA - Q P S 
acatcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacata 

TSKPIHART-RIASSRVINI 
ttagggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

LG-PLTW-FILKPIIEMSLR 



3 '5' Frame 2 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctg 

V-DV-TGHLVEVLH-LW-IL 
tgttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaa 

CY'FQC QHNQ SVQLL S - HL - K 
atcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctcta 

ILAGEVG-YPQHL-LHDSPL 
catcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatat 

HQSQSTHERDE-LLRG - - TY 
tagggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

-GNH-LGNSF-NPS-R-VYG 
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3' 5' Frame 3 



ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgt 

YKMFKLVTWWRFCINSGEFC 
gttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaa 

VIFSVNITSRYSY.-VNTCRK 
tcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctac 

S-LER-VSTHSISSCMTALY 
atcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatatt 

IKANPRTNVTNSFFAGDKHI 
agggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

RVTIDLVIHFETHHRDESTV 



FIGURE 130 

10 20 30 40 50 60 



SEQ ID NO: 9997 KGHDLRRL I SMMGFKMNYQVNGYPNMF I TREEAIRHVRAWI GFDVEGCHATRDAVGTNLP 

SEQ ID NO: 10034 YRRLISMGFKiyiNYQWGYPNMFITREEAIRHVMWIGFDVEGCHATRDAVGTNLP 

******************************************************* 

Prim. Cons. ' KGHD2RRLISMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLP 

70 80 90 100 110 120 



SEQ ID NO: 9997 LQLGFSTGVNLVAVPTGYVDTENNTKFTRVNAQTSTSEQFKHLIPLMYKGLPWNWRIKI 

SEQ ID NO: 10034 LQLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLI 

*************************.******. .****** 

Prim. cons. LQLGFSTGVNLVAVPTGYVDTENNT2 FTRVNA2 2222 2QFKHLI PLMYKGLPWNWRIKI 

130 140 150 160 170 180 



SEQ ID NO: 9997 VQMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC 
SEQ ID NO: 10034 

Prim, cons . VQMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC 

190 200 

I I 
SEQ ID NO: 9997 WHSVGFDYVYNPFMIDVQQWGLYG 

SEQ ID NO: 10034 
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FIGURE 131 

5' 3' Frame 1 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgaga 

QVHQNVCVL-LIFYLMTLSR 
taataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatg 

--SHKICQ-FQKWSRLQLTM 
ctgaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaac 

LKFHSCFGVRMDMLKPSTQN 
tacaagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaa 

YKQVKRGNQVLRCLTCTRCK 
gaatgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaag 

ECFLKSVTFRIMVKMLLYQK 
gaataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttactt 

E---MSQSILNCVNT-IHLL 
tagctgtaccctccaacatgagagttattcactttggtgctgg 

-lyppt-elftlvl" 



5 '3' Frame 2 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgagat 

RFIKMCVFCD - S FT- - LCRD 
aataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatgc 

NKVT RFVSDF KSGQGYN-LC 
tgaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaact 

-NF I H A L V - GWT C - N LLPKT 
acaagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaag 

TSKS SVATRCCDA-LVQDAK 
aatgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaagg 

NAS-KV-PSELW-KCCYTKR 
aataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttacttt 

NNDECRKV YSTVS ILKYTYF 
agctgtaccctccaacatgagagttattcactttggtgctgg 

SCTLQHESYSLWCW 



5 • 3 1 Frame 3 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgagata 

GSSKCVCSVIDLLLDDFVEI 
ataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatgct 

IKSQDLSVISKVVKVTIDYA 
gaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaacta 

EI SFMLWCKDGHVETFYPKL 
caagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaaga 
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QASQAWQPGVAMPNLYKMQR 
atgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaagga 

MLLEKCDLQNYGENAVIPKG 
ataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttacttta 

IMMNVAKYTQLCQYLNTLTL 
gctgtaccctccaacatgagagttattcactttggtgctgg 

AVP SNMRVIHFGA 



3" 5' Frame 1 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaag 

PAPK - ITLMLEGTAKVSVFK 
tattgacacagttgagtatactttgcgacattcatcattattccttttggtataacagca 

Y-HS-VYFATFIIIPFGITA 
ttttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaag 

FSP-F-RSHFSRSILCILYK 
ttaggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggtt 

LGIATPGCHA-LACSFG-KV 
tcaacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacc 

STC PS L HQ SMN E I S A - S IVT 
ttgaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatca 

LTTFEITDKSCDFIISTKSS 
agtaaaagatcaatcacagaacacacacattttgatgaacctg 

SKRSITEHTHFDEP 

3" 5' Frame 2 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaagt 

QHQSE-LSCWRVQLK-VYLS 
attgacacagttgagtatactttgcgacattcatcattattccttttggtataacagcat 

IDTVEYTLRHSSLFLLV-QH 
tttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaagt 

FHHNSEGHTFQEAFFASCTS 
taggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggttt 

-ASQHLVATLDLLVVLGRRF 
caacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacct 

QHVHPYTKA-MKFQHSQL-P 
tgaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatcaa 

-PLLKSLTNLVTLLSRQSHQ 
gtaaaagatcaatcacagaacacacacattttgatgaacctg 

VKDQSQNTHILMNL 



3' 5' Frame 3 
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ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaagta 

STKVNNSHVGGYS-SKCI-V 
ttgacacagttgagtatactttgcgacattcatcattattccttttggtataacagcatt 

LTQLSILCDIHHYSFWYNSI 
ttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaagtt 

FTIILKVTLFKKHSLHLVQV 
aggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggtttc 

RHRNTW'LPRLTCL - FWVEGF 
aacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacctt 

NMSILTPKHE-NFSIVNCNL 
gaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatcaag 

DHF-NH-QIL-LYYLDKVIK 
taaaagatcaatcacagaacacacacattttgatgaacctg 

-KINHRTHTF - - T 



FIGURE 132 

5'3' Frame 1 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctct 

-VFTYPGKA NQ PRSLVDLF S 
aaacgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccg 

KRTN-NV - - WTPIKPT-CPP 
cattacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatg 

HYIWWTHRFN-Q-PEWRTAM 
gggcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttca 

GQGQNSADPKVYPIILRLGS 
cagctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaa 

QLSLSMARRNLDSLEARAFQ 
tcaacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgag 

STPIVVQMTKLATTEELPDE 
ttcgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctag 

FVVVTAK-KSSAPDGTSIT- 
gaactggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttg 

ELAQKLHFPTALTKKASYGL 
caactgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataaca 

QLREP-IHPKTTLAPAILIT 
atgctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcag 

MLPPCYNFLKEQHCQKASTQ 
agggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaatt 

REAEAAVKPLLAPHHVVAVI 
caagaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggag 

QEIQLLAAVGEILLLEWLAE 
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gtggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaag 

VVKLPSRYCC-TD-TSLRAK 
tttctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgagg 

PLVKANNNKAKLSLRNLLLR 
catctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcat 

HLKSLAKNVLPQNSTTSLKH 
ttgggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagac 

LGDVVQNKPKEISGTKT-SD 
aaggaactgattacaaacattgggccgcaaattgcacaatttgcct 

KELITNIGPQIAQFA 



5*3* Frame 2 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctcta 

RFLPTQEKPTNLDLL-ICSL 
aacgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccgc 

NEQIKMSDNGPQSNQRSAPR 
attacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatgg 

ITFGGPTDSTDNNQNGGLQW 
ggcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttcac 

GKAKTAPTPRFTQ-YCVLVH 
agctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaat 

SSHSAWQGG T-IPSRPGRS N 
caacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgagt 

QH,Q-WSR- PNWLLPKSYPTS 
tcgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctagg 

SWW-RQNERAQPQMVLLLPR 
aactggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttgc 

NWPRSFTSLRR-QRRHRMGC 
aactgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataacaa 

N- GSLEYTQRPHWHPQS- - Q 
tgctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcaga 

CCHRATTSSRNNIAKRLLRR 
gggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaattc 

GKQRRQSSLFSLLIT-SR-F 
aagaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggagg 

KKFNSWQQ-GKFSCSNG-RR 
tggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaagt 

W-NCPRAIAARQIEPA-EQS 
ttctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgaggc 

FW-RPTTTRPNCH-EICC-G 
atctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcatt 

I - KASPKTYCHKTVQRHS S I 
tgggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagaca 

WETWSRTNPRKFRGPRPNQT 
aggaactgattacaaacattgggccgcaaattgcacaatttgcct 

RN-LQTLGRKLHNLP 
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5 '3' Frame 3 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctctaa 

GFYLPRKSQPTSISCRSVL- 
acgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccgca 

TNKLKCLIMDPNQTNVVPPA 
ttacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatggg 

lhlvdpqiqltitrmedcn'g 

gcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttcaca 

ARPKQRRPQGLPNNIASWFT 
gctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaatc 

ALTQHGKEELRFPRGQGVPI 
aacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgagtt 

NTNSG PDDQ IGYYRRATRRV 
cgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctagga 

RGGDG 'KMKELSPRWYFYYLG 
actggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttgca 

TG PEASLPYGANKEGIVWVA 
actgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataacaat 

TEGALNTPKDHIGTRNPNNN 
gctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcagag 

AATVLQLPQGTT- LP'KGFYAE 
ggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaattca 

GSRGGSQASSRSSSRSRGNS 
agaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggaggt 

RNSTPGSSRGNSPARMASG G 
ggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaagtt 

GETALALLLLDRLNQLESKV 
tctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgaggca 

SGKGQQQQG QTVTKKSAAEA 
tctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcattt 

SKKPRQKRTATKQYNVTQAF 
gggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagacaa 

GRRGPEQTQGNFGDQDLIRQ 
ggaactgattacaaacattgggccgcaaattgcacaatttgcct 

GTDYKHWAANCTIC 



3'S 1 Frame 1 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtct 
RQIVQFAAQCL-SVPCLIRS 

tggtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacg 
WSPKFPWVCSGPRLPNA-VT 

ttgtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagat 
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LYCFVAVRFWRGFLDASA AD 
ttcttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagc 

FLVTVWPCCCWPLPETLLSS 
tggttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccatt 

WFNLSSSNSARAVSPPPLAI 
cgagcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgacta 

RAGEFPLLLPGVEFLELPRL 
cgtgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagcct 

RDEEREEA-LPPLLPSA-KP 
tttggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattg 

FGNVVP-GSCSTVAALLLGL 
cgggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatg 

RVPMWSLGVFKAPSVATHTM 
ccttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaag 

PSLLAP-GSEASGPVPR--K 
taccatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagct 

YHLGLSSFILPSPPRTRRVA 
cttcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctgg 

LR - - PIWSS GPLLVL IGTPW 
cctcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgca 

PRGNLSSSLPC-VRAVNQDA 
atattattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctcc 

ILLGKPWGRRCFGLAPLQSS 
attctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacg 

I L V I V S - ICGST KCNAGGTT 
ttggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatcta 

LV-LGSIIRHFNLFV-RTDL 
caagagatcgaggttggttggcttttcctgggtaggtaaaaaccta 

Q EIEVGWLFLGR-K.P 



3' 5' Frame 2 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtctt 

GKLCNLRPNVCNQFLV-LGL 
ggtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacgt 

GPRNFLGFVLDHVSQMLE-R 
tgtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagatt 

CTVLWQYVFGEAF-MPQQQI 
tcttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagct 

S - - QFGLVVVGLYQKLCSQA 
ggttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccattc 

GSICLAAIARGQFHHLR-PF 
gagcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgactac 

EQENFPYCCQELNFLNYRDY 
gtgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagcctt 

VMRSEKRLDCRLCFPLRRSL 
ttggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattgc 
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LAMLFLEEVVARWQHCY-DC 
gggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatgc 

GCQCGLWVYS RLPQLQPIRC 
cttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaagt 

LLC-RRREVKLLGQFLGNRS 
accatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagctc 

TIWG-ALSFCRHHHELVG-L 
ttcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctggc 

FGSSQFGHLDHYWC-LERPG 
ctcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgcaa 

LEGI - VPPCHAE-EL-TKTQ 
tattattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctcca 

YYWVNLGVGAVLAL PHC S PP 
ttctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacgt 

FWLLSVESVGPPNVMRGALR 
tggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatctac 

WFDWGP LSDILIC S FREQ IY 
aagagatcgaggttggttggcttttcctgggtaggtaaaaaccta 

K. R SRLVGFSWVGKNL 



3 '5' Frame 3 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtcttg 

A NCAICGPMFV ISSLSD-VL 
gtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacgtt 

VPEISLGLFWTTSPKCLSDV 
gtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagattt 

VLFCGSTFLARL FRCL SSRF 
cttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagctg 

LSDSLALLLLAFTRNFALKL 
gttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccattcg 

VQSV - QQ-REGSFTTSASHS 
agcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgactacg 

SRRISPTAARS-IS-ITATT 
tgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagccttt 

--GARRGLTAASASLCVEAF 
tggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattgcg 

WQCCSLRKL-HGGSIVIRIA 
ggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatgcc 

GANVVFGCIQGSLSCNPYDA 
ttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaagta 

FFVSAVGK-SFWASS-VIEV 
ccatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagctct 

PSGAEL FHFAVTTTNS SGSS 
tcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctggcc 

SVVANLVIWTTIGVDWNALA 
tcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgcaat 
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SRESKFLLAMLSESCEPRRN 
attattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctccat 

IIG-TLGSALFWPCPIAVLH 
tctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacgtt 

SGYC QLNLWVHQM-CGGHYV 
ggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatctaca 

GLIGVHYQTF-FVRLENRST 
agagatcgaggttggttggcttttcctgggtaggtaaaaaccta 

RDRGWLAFPG-VKT 
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FIGURE 133 
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FIGURE 134 
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